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ABSTRACT Determination of degree of relationship traditionally has been undertaken using genotypic KEYWORDS 
data on individual loci, typically assumed to be independent. With dense marker data as now available, it is relationship 
possible to identify the regions of the genome shared identical by descent (ibd). This information can be identity-by- 
used to determine pedigree relationship (R), e.g., cousins vs. second cousins, and also to distinguish descent 
pedigrees that have the same Wright's relationship (R) such as half-sibs and uncle-nephew. We use simu- genomic identity 
lation to investigate the accuracy with which pedigree relationship can be inferred from genome sharing for likelihood 
uniparental relatives (a common ancestor on only one side of their pedigree), specifically the number, 
position (whether at chromosome ends), and length of shared regions ibd on each chromosome. Moments 
of the distribution of the likelihood ratio (including its expectation, the Kullback-Leibler distance) for alter- 
native relationships are estimated for model human genomes, with the ratio of the mean to the SD of the 
likelihood ratio providing a useful reference point. Two relationships differing in R can be readily distin- 
guished provided at least one has high R, e.g., approximately 98.5% correct assignment of cousins and half- 
cousins, but only approximately 75% for second cousins once removed and third cousins. Two relationships 
with the same R can be distinguished only if R is high, e.g., half-sibs and uncle-nephew, with probability of 
correct assignment being approximately 5/6. 



Relatives carry individual genes and also genomic regions identical by 
descent (ibd). In many situations in human, natural, or agricultural 
populations, it is important to identify relatives and, if possible, degree 
of relationship using this information. Traditionally, methods of 
identifying relatives have used information regarding identity in 
state (ibs) of individual genes (Weir 1996), with increasingly dense 
markers enabling increasingly high precision, using methods such as 
CERVUS (Marshall et al. 1998), components of PUNK (Purcell et al. 
2007), or COANCESTRY (Wang 2011). 

Traditionally, establishing relationships does not use information 
regarding location in the genome, and statistical properties are often 
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based on assuming unlinked markers. Linkage information can be 
incorporated, however, by using the linkage map and taking into 
account the Markovian nature of the ibd process underlying the 
genotypes of relatives at linked loci (Epstein et al. 2000; McPeek 
and Sun 2000; Kyriazopoulou-Panagiotopoulou et al. 2011) using 
methods such as RELPAIR (Epstein et al. 2000). Regions of the 
genome that are shared ibd can be established using identity in state 
{e.g., Abecasis et al. 2002; Roberson and Pevsner 2009) using pro- 
grams such as MERLIN (Abecasis et al. 2002). 

Alternatively, distantly related individuals can be identified from 
multilocus sharing of even quite small regions of the genome (Browning 
and Browning 2011, 2012, 2013). If it is known that two individuals are 
related, then the allelic information adds little on regions already clearly 
shared ibd as determined by common sequence (except perhaps on ibs 
of two very-low-frequency genes). Further, the use of information on 
shared regions rather than just individual loci allows, at least in prin- 
ciple, discrimination between relationships with the same Wright's 
relationship R but different pedigree R, e.g., uncle-nephew and half- 
sib, both of which have R = 0.25. R used here is strictly Wright's 
numerator relationship, which equals twice the kinship (coancestry), 
but it is the same as Wright's relationship in the absence of inbreed- 
ing. R defines the pedigree (Table 1). Further, the actual proportion 
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Table 1 Pedigree relationships (R), Wright's coefficient of relationship (R), and abbreviations used 



Relationship R 



Full-sib family-based R 



Half-sib family-based R 



Lineal R 



1/4 
1/8 

1/8 
1/16 

1/16 
1/32 

1/64 

1/128 



Uncle-nephew a UN Half-sibs 

Great-uncle-great-nephew GUGN Half-uncle-nephew 



Full cousins C 
Cousins once removed C1 R 



Second cousins 2C 

Second cousins once 2C1R 

removed 

Third cousins 3C 



Half-cousins 

Half-great-uncle-great-nephew b 
Half-cousins once removed 



HS 
HUN 



HC 



Grandparent-grandoffspring GPO 
Great-grandparent-great- GGPO 
grandoffspring 



GtGtgrandparent- 
GtGtgrandoffspring 



G3PO 



G4PO 



HC1R 

Half second cousins H2C 
Half second cousins once removed H2C1R Further generation as above G6PO 



GtGtGtgrandparent- 
GtGtGtgrandoffspring 
Further generation as above G5PO 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; HUN, half-uncle-nephew; GGPO, great-grandparent-great- 
grandoffspring; C, cousin; C1R, cousin once removed; HC, half-cousin; Gt, great; G3PO, great-great-grandparent-great-great-grandoffspring; 2C, second cousin; 
HC1 R, half-cousin once removed; G4PO, great-great-great-grandparent-great-great-great-grandoffspring; 2C1 R, second cousin once removed; H2C, half second 
cousin; G5PO, great-great-great-great-grandparent-great-great-great-great-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; 
G6PO, great-great-great-great-great-grandparent-great-great-great-great-great-grandoffspring. 
Including uncle/aunt-nephew/niece. 

Relationship not included in subsequent Tables as the distribution is identical to that for half-cousins. 
Similarly, other relationships with the same R and family base but different pedigree are not analyzed, e.g., (full) cousins twice removed = second cousins [R = 1/32), 
and half-cousins twice removed = half second cousins (R = 1/64). 



of the genome shared can, by chance, be higher by more distant (e.g., 
second cousins: R = 1/32) than closer relatives (e.g., first cousins 
once removed: R = 1/16). The proportion of overlap of the distri- 
bution of actual relationship increases as the relationship of each of 
the pairs becomes more distant (e.g., R = 1/64 vs. 1/32) (Hill and 
Weir 2011), further increasing the problem of determining the ped- 
igree R. 

The pedigree relationship may be needed in a number of situations. 
The estate of an individual who dies intestate may by law have to be 
divided among his or her closest relatives. Courts would assume this to 
be defined by pedigree. Another situation would be in identification of 
individuals in forensic cases, for example, in identifying a body or 
a body part in a disaster zone, or in familial searching for relatives of 
an offender already in the database (Rohlfs et al. 2012). In studies of 
natural populations, pedigree construction is an important component 
in determining breeding structure and estimation of genetic param- 
eters (Blouin 2003; Pemberton 2008). 

Detection of genomic regions for which there is biparental sharing, 
i.e., individuals with ibd genotype at each diploid locus due to re- 
lationship through both parents (e.g., full-sibs or double first cousins), 
is quite straightforward because there is ibs at each locus in the region. 
We consider here only the much more common situation of unipa- 
rental sharing, in which case R is half the probability relatives share 
one allele ibd at a locus, or half the expected proportion of uniparental 
genome shared. 

Therefore, a quantitative description of the number, position, and 
length of shared regions is all the information we can have about 
relationship of a pair of individuals in the absence of pedigree 
information, and this sets an upper limit to what we can detect. 
Our objective is to find what this limit is for different alternative 
pedigrees. Therefore, as a reference point, we work on the premise 
that we have precise estimates of these quantities but later consider 
this assumption. We also assume there are no confounding factors, 
such as inbreeding of the common ancestor or relationships among 
other ancestors of the pair. Data on gene frequencies and genotypes 
at individual loci then add no further information. 

We focus on identifying specified pedigree relationship from actual 
or realized genomic sharing, for example, whether a pair of individuals 



are related either as second cousins or as cousins once removed, in 
each case assuming there is uniparental sharing. Such comparisons 
can be undertaken based on a likelihood ratio, although the appropriate 
test or discrimination depends on the questions to be answered, such 
as the following. Which of two or more alternative relationships is 
the most probable? How sure are we? What relationships can we 
exclude? 

The variation in the total length can be computed (Hill and Weir 
2011) and there are also various methods and approximations for 
computing the numbers and distribution of the lengths of shared 
regions (Fisher 1954; Donnelly 1983; Stam 1980). Recently, Huff et al. 
(2011) have proposed methods to identify whether pairs of individuals 
taken from the population are related more closely, e.g., as second 
cousins, than background relationship among all population mem- 
bers from distant relationships in a finite closed population. 

There is no theory available that enables prediction of the numbers 
and distribution of shared segments exactiy for arbitrary relationships. 
Therefore, we use simulation to generate the required probability 
distributions. There are, however, approximations for some of these 
distributions available: for example, Huff et al. (2011) assumed 
a Poisson distribution of number and exponential distribution of 
shared regions (i.e., independence), and we also investigated their 
accuracy. We conclude with a discussion on inference. The primary 
objective was to set the theoretical framework and compute what can 
be achieved rather than focus on applications per se. 

MATERIALS AND METHODS 
Simulation 

The simulation program was used previously to check theoretical results 
for the variance of the length of shared regions on a chromosome (Hill 
and Weir 2011), which in turn provided a check on the program itself. 
Simulations were undertaken for a single chromosome, for example, 
of length / Morgans, in each independent replicate. There was as- 
sumed to be a uniform recombination rate and no interference, i.e., 
corresponding to a Haldane mapping function. The number of re- 
combination events was sampled from a Poisson distribution and 
their positions were sampled as real valued numbers independently 
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Table 2 Examples from 100,000 simulated replicates of the distribution of the numbers (n s ) of genomic segments shared by relatives 
on a chromosome of 1 .632 M 





1 IM 
UN 






C\ P 
U I r\ 


2C 


2C1R 


3C 


LIC 

no 


Ull IM 
HUN 


UrU 




R 


1/4 


1/8 


1/8 


1/16 


1/32 


1/64 


1/128 


1/4 


1/8 


1/4 


1/8 


n s 










Number of replicates 










0 


921 


17,696 


9976 


33,823 


55,541 


71,901 


83,006 


1826 


19,559 


9965 


28,052 


1 


14,948 


29,825 


29,729 


33,471 


27,620 


19,682 


12,902 


24,700 


34,913 


54,387 


43,923 


2 


35,379 


28,580 


32,644 


20,836 


1 1 ,902 


6317 


3180 


41,257 


28,856 


30,210 


22,223 


3 


31 ,041 


1 6,286 


19,168 


8677 


3789 


1647 


764 




1 2,655 


5046 


5149 


4 


13,556 


5940 


6719 


2526 


934 


362 


122 


6675 


3379 


386 


610 


5 


3501 


1408 


1515 


561 


184 


82 


22 


1150 


579 


6 


39 


6 


587 


237 


231 


95 


28 


8 


3 


101 


57 


0 


4 


7 


65 


26 


17 


11 


2 


1 


1 


6 


2 


0 


0 


8 


2 


2 


1 


0 


0 


0 


0 


1 


0 


0 


0 


9 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



UN, uncle-nephew; GUGN, great-uncle-great-nephew; C, cousin; C1 R, cousin once removed; 2C, second cousin; 2C1R, second cousin once removed; 3C, third 
cousin; HS, half-sib; HUN, half-uncle-nephew; GPO, grandparent-offspring; GGPO, great-grandparent-great-grandoffspring. 



from the uniform distribution. All regions of ancestral chromosomes 
were labeled by the same integer value, e.g., 1, 2, and so on. Hence, 
a chromosome of a descendant was defined by the position (it) of 
each of the n — 1 recombination events, e.g., 0 = tt 0 < tti, tt 2 , ■ ■ ., 
tt„-i < tt„ = 1, denning the n chromosomal regions labeled hi, ■ ■ ., 
h n . Then, for example, n = 4,ir 1 = 0.1256, ir 2 = 0.5701, ir 3 = 0.9012, 
and hi = 1, h 2 = 2, h 3 = 1, h 4 = 3 denote a chromosome for which the 
first region (from 0.0 to 0.1256) and third region (from 0.5701 to 
0.9012) were derived from ancestor 1, the second was derived from 
ancestor 2, and the third was derived from ancestor 3, and thus ibd for 
that genomic region with these respective ancestors. This does not 
imply that the parent has a chromosome with exactly that haplotype, 
but that a gamete could be formed from it that does, i.e., the shared 
region may span grandpaternal and grandmaternal origins. Hence, for 
a second descendent of the same individuals with, for example, n = 2 
and tti = 0.3659, hi = 2, h 2 =1, there is sharing between the two 
descendents in two regions, between 0.1256 and 0.3659 from ancestor 
2 and between 0.5701 and 0.9012 from ancestor 1, i.e., internal regions 
of length 0.2403 and 0.3311, respectively, with a total proportion of 
0.5714. 

To obtain the results presented here, 100,000 or more independent 
replicates were performed. For each replicate the sharing among 
different kinds of relatives was computed, so for a founder full-sib 
family, the degree of sharing of, for example, uncle and nephew (or 
aunt and nephew, etc, because only autosomes were simulated), 
great-uncle and great-nephew, and cousins of degree up to third 
cousins were sampled successively. Although this induced sampling 
correlations, these were trivial because replicates were numerous and 
independent. Simulations were performed independently for chro- 
mosomes of different length and for three different founder relation- 
ships: linear descendants, full-sib-based, and half-sib-based (Table 1). 

Distribution of shared segments 

Numbers of shared segments: We provide examples to illustrate the 
kind of data available from the simulation for a map length of an 
"average" human chromosome of 1.632 M (based on Kong et al. 
2004). Table 2 shows the distribution of numbers of shared segments 
(n s ) for a range of relationships from a full-sib base and for a more 
limited number of half- sib-derived and lineal relationships. For this 
length of chromosome there is a less than 1% chance that uncle and 
nephew share no genome and approximately 15%, 35%, and 31% 



probability that they share 1, 2, and 3 regions, respectively. For half- 
sibs, who have the same Wright's relationship (1/4) as uncle-nephew, 
the probabilities are 2%, 25%, 41%, and 24%, respectively. Of course, 
more distant relatives share fewer and smaller regions. For longer 
chromosomes (in terms of map length or expected number of recom- 
binations) than shown in Table 2, the expected number of shared 
regions increases and length of individual segments decreases. 

Positions of shared segments: Information also can be obtained from 
position of the shared regions, specifically whether they include the 
chromosome ends. Examples of the distribution of shared regions on 
the chromosome according to their position, specifically whether they 
include both, one, or no ends of the chromosome (p s = 2, 1, 0, re- 
spectively), are shown in Table 3. A single region sharing both ends 
rarely occurs unless the relationship is close, and the proportion shar- 
ing at neither end increases as the relationship becomes more distant. 
Half-sibs are more likely to share regions including both chromosome 
ends than are uncle and nephew. 

Lengths of shared segments: The expected proportion of genome 
shared (i.e., 2R) is of course the same as the overall length of shared 
regions expressed as a proportion of the genome length, but the 
distribution of the lengths of the total and of individual shared seg- 
ments depends on the pedigree R. Examples are also given in Table 3 
for the mean and SD of the total length actually shared, expressed as 
a proportion of the chromosome length I = 1.632 M, as a function of 
whether the shared regions include zero, one, or two chromosome 
ends. A special case is when n s = 1 and p s = 2, when the length is 
invariant because the whole chromosome is shared. 

Summary of simulated statistics 

Because the numbers (n s ) and positions (p s ) are discrete valued var- 
iables, to facilitate subsequent analysis total length shared on the 
chromosome also was summarized in discrete values, namely as 
the number of tenths of the chromosome shared (t s ): if x (>0) is the 
length shared, then for f s = 1: 0 < x S 0.1, t s = 2: 0.1 < x < 0.2; . . .; f s = 
10: 0.9 < x < 1.0. The distribution of the length of individual shared 
segments conditional on the numbers, positions, and total length 
shared on each chromosome was not included in subsequent anal- 
yses because it contains no additional information. For example, if 
there are two shared segments of total length x, then the relative 
lengths y and x-y tell us nothing about the numbers of generations 
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Table 3 Examples using simulations as in Table 2 of the distribution of numbers (n s ), positions (p s ), mean, and SD of total length of 
genomic regions shared by relatives on a chromosome of 1.632 M 





Ps 




UN 






C 






2C 






HS 




M 


Mean 


en 
bD 


M 

In 


Mean 


bD 


M 

IN 


Mean 


bu 


M 


Mean 


bU 


0 




921 






9976 






55,541 






1826 






1 


2 


947 


1.000 


0.000 


43 


1.000 


0.000 


0 


— 


— 


1936 


1.000 


0.000 




1 


7041 


0.508 


0.288 


8888 


0.211 


0.196 


6007 


0.110 


0.110 


12,409 


0.502 


0.289 




0 


6960 


0.328 


0.237 


20,798 


0.162 


0.150 


21,613 


0.096 


0.095 


10,355 


0.336 


0.236 


2 


2 


7044 


0.675 


0.234 


1886 


0.407 


0.245 


177 


0.228 


0.172 


10,122 


0.665 


0.236 




1 


18,878 


0.500 


0.226 


14,514 


0.307 


0.190 


3704 


0.199 


0.137 


22,119 


0.501 


0.224 




0 


9457 


0.398 


0.202 


16,244 


0.260 


0.162 


8021 


0.177 


0.120 


9016 


0.402 


0.201 


3 


2 


9768 


0.605 


0.201 


2494 


0.435 


0.204 


148 


0.295 


0.1 86 


9152 


0.600 


0.200 




1 


15,811 


0.503 


0.191 


9837 


0.366 


0.170 


1500 


0.271 


0.145 


1 1 ,858 


0.497 


0.189 




0 


5462 


0.428 


0.178 


6837 


0.317 


0.155 


2141 


0.237 


0.124 


3274 


0.428 


0.176 


4 


2 


5467 


0.572 


0.177 


1411 


0.457 


0.182 


83 


0.405 


0.174 


3109 


0.569 


0.173 




1 


6515 


0.500 


0.169 


3486 


0.402 


0.161 


418 


0.336 


0.148 


2978 


0.501 


0.167 




0 


1574 


0.444 


0.158 


1822 


0.361 


0.147 


433 


0.294 


0.126 


588 


0.440 


0.159 


>4 


All 


4155 






1764 






214 






1258 







UN, uncle; C, cousin; 2C, second cousin; HS, half-sib. 

p s =2, 1, 0 denotes sharing at both, one, and neither end of the chromosome, respectively. 



apart. Although shown by simulation, on reflection it is obvious 
because the distribution is uniform. 

To simulate the 22 human autosomes, map lengths were simplified 
into five classes, based on the data of Kong et al. (2004), and pre- 
viously were used for illustration (Figure 5 of Hill and Weir, 2011), 
namely two chromosomes of 0.75 M, eight chromosomes of 1.25 M, 
six chromosomes of 1.75 M, four chromosomes of 2.1 M, and two 
chromosomes of 2.75 M, totaling 35.9 M. Simulation also was un- 
dertaken assuming 22 chromosomes each of 1.632 M, i.e., with the 
same average length as in the model using five lengths. As shown later, 
there is little difference in predictions of discriminating ability between 
the five-length and one-length models, so further subdivision of chro- 
mosome lengths to more closely match those for humans for analysis 
would have little impact on calculations or conclusions. This does not, 
however, imply that individual lengths should be ignored in analyses 
of real data. 

As inferred from Hill and Weir (2011), from variances of actual 
relationship and also from simulations, for half-sib-based relation- 
ships the distribution of shared regions (« s , p s , and f s ) depends only 
on Wright's numerator relationship R. For example, it is the same for 
half-cousins and half-great-uncle-great-nephew relationships (both 
R = 1/16), and for half second cousins and half-cousins twice removed 
(R = 1/64). Similarly, for full-sib-based relationships, the distribution 
is the same for second cousins and first cousins twice removed (R = 1/ 
32), but is not the same for great-uncle-great-nephew and cousins 
(R = 1/8). 

Likelihood ratios 

Computation: Let k denote a specific realization {n s , p s , fj of genome 
sharing on a specified chromosome, and let P R (k) denote the proba- 
bility of this outcome dependent on the chromosome length and 
conditional on the relationship being R (e.g. half-sibs). If, for example, 
only information on n s is used, then the realization is simply {nj. The 
contribution provided by the observation k to the log likelihood ratio 
A (A : B) for relationships A and B is then logP A (fc) — logP B (fc) using 
the logarithm inter alia because it has better sampling properties. We 
use the simulation results to obtain these probabilities, computed 
simply as the proportion of replicates with the appropriate outcome. 



Thus, using only data on n s , for example, and assuming three shared 
segments, then A (UN : HS) ~ ln(0.3 10/0.243) = 0.245. and A (UN : 
GUGN) ~ ln(1.91) = 0.645 (Table 2). Because segregation over chro- 
mosomes is independent, the total log likelihood ratio is obtained by 
summing contributions to the log likelihood ratio from different chro- 
mosomes, using probabilities appropriate to the map length and re- 
alization for each chromosome. If there is previous information 
regarding the relationships from nongenetic data and these can be 
quantified, then Bayes theorem can be used straightforwardly to com- 
pute posterior probabilities of alternative relationships. Otherwise, 
application is context-dependent, and we discuss that subsequently. 

Moments: Although any testing is situation-specific, we can in- 
vestigate the properties of the log likelihood ratio as a function of the 
data used and possible relationships to be compared. Thus, we 
consider its moments, specifically its mean and variance. If the real 
relationship is A, then the contribution to the mean from a single 
chromosome is as follows in equation 1: 

E A [A(A : B)] = % k P A (k)[logP A (k) - logft (*;)], 

and there is an equivalent formula for the variance. The overall 
mean and variance of A are obtained by summing contributions over 
chromosomes. We also compute its skew and kurtosis. 

The mean A is the (directed) Kullback-Leibler distance between the 
two distributions P A and P B (Kullback and Leibler 1951; Burnham 
and Anderson 2001). This "distance" is not symmetric, i.e., in general, 
E A [A(A : B)] E B [A(B : A)]. Subsequently, we tabulate values over 
the correct distribution (i.e., real relationship) such that they are 
positive. 

Because the numbers of shared segments and their positions are 
count data and because lengths shared were analyzed similarly as 
discrete variables, the numbers in each defined class k have a multi- 
nomial distribution with parameters estimated from the simulation 
results. In computing the moments of A, the expected probabilities 
P R (k) were assumed to have been estimated by simulation with neg- 
ligible error. If the estimate from simulation of P A (k) was not zero but 
that of P B (fc) was zero, then in computing the term P A (k) [logP A (fc) - 
logP B (fc)], it was assumed that P B (fc) = 1/(2N), where N was the 
number of replicates simulated. This term becomes important only 
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2C1R 




2C 



1 



CI R 




0.0 1 .0 2.0 
(versus C1R) 



0.0 1 .0 2.0 
(versus 2C) 



0.0 1 .0 2.0 
(versus 2C1R) 



Figure 1 Cumulative contribution of sources of information to 
expected log likelihood ratio relative to its SD. E(A)/SD(A): number of 
segments only (n s left), additional contribution from fitting position of 
segments (p s center), and additional contribution from fitting length 
of segments (t s , right). (Upper) Sets of closely related pairs for real 
relationship uncle-nephew (UN) or cousin (C). (Lower) Sets of more 
distantly related pairs for cousins once removed (C1 R), second cousins 
(2C), and second cousins once removed (2C1 R). 



when the distributions differ greatly [in which case E(A) is already 
large] and when expected numbers in cells become very small. To 
reduce errors such as this due to simulation, because data regarding 
numbers of segments itself included data regarding lengths, results 
given utilizing « Sj p s , and t s used all three for 1 < n s < 4, but only 
n s and p s for n s > 4. 



The ability to discriminate between alternative relationships using 
the likelihood ratio depends on the distribution of A, mainly on the 
relative sizes of its mean and SD, so we tabulate E(A)/SD(A). Because 
there is replication of observations across chromosomes, SDs were 
computed over the aggregate, and therefore might be regarded as 
standard errors, but we retain the SD notation. We also found that 
\ typically has close-to-normal form. 

RESULTS 

Moments of log likelihood ratios 

Expectation: Information available for contrasting relationships 
expressed as expected log likelihood ratios [E(A), Kullback-Leibler 
distances] are provided in the upper part of Table 4 for a subset of 
relationships using the full simulated data for numbers (n s ), posi- 
tions (p s ), and lengths (f s ) of shared segments. In these and sub- 
sequent tables, rows denote the real relationship and columns denote 
the hypothesized relationship. Values of E(A) for all 19 relationships 
analyzed and incorporating successively more information are given 
in Appendix Table Al (using n s only), Table A2 (using n s and p s ), 
and Table A3 (using n s , p s , and t s , i.e., as in Table 4). In all these 
Tables, values were computed from simulation runs for each of the 
designated five map lengths (0.75, 1.25. 1.75, 2.10, and 2.75 M), each 
replicated 100,000 times, i.e., as weighted averages over a total of 
500,000 replicates. 

It was seen that E(A) is small when relationships are distant and of 
similar magnitude (Table 4, upper part), e.g., second cousins and half- 
cousins once removed (for both of which R = 1/32). Although Kull- 
back-Leibler distances are not symmetric, the reciprocal cases here are 
usually close but not identical in value, so only half the pairs of assumed 
relationships are included in Table 4 (but all are in the Appendix 
Tables). E(A) is typically higher when the likelihood ratio is condi- 
tional on the higher relationship of the two, presumably because 
there is a wider distribution of numbers and lengths of segments 
shared among close relatives and therefore there is more informa- 
tion in the data. 

The increment in E(A) by incorporating position and length can be 
substantial for comparisons involving quite closely related individuals 
(Appendix Table Al, Table A2, and Table A3). As they become 
distant, e.g., half-cousins vs. third cousins, the absolute and propor- 
tional increase is small. First, few shared segments are at the ends of 
chromosomes and the coefficient of variation in length shared decreases 
as the number of segments shared increases. 

Expectation vs. sampling error 

SDs of A values using all information (« s , p s , t s ) are given in Table 4 
(lower) for a number of relationships. Examples of E(A)/SD(A) for two 
subsets of relationships, one including pairs of high relationships 
(1/16 ^ R £ 1/4 in Table 5) and the other including pairs of more 
distant relationship (1/128 S R < 1/16 in Table 6). Later, we discuss 
the interpretation of these values and show that the ratio is, at least 
approximately, a noncentrality parameter determining the probability 
of misassignment. Approximately, a value of 2.0 or more indicates 
a pair of relationships that can be distinguished with reasonable con- 
fidence. Full data fitting different amounts of information are given for 
SD(A) in Appendix Table A4, Table A5, Table A6 and for E(A)/SD(A) 
in Appendix Table A7, Table A8, and Table A9. It is seen that SD(A) 
tends to increase along with E(A) as relationships become more dif- 
ferent, e.g., uncle-nephew vs. half-sib and vs. cousin (Table 5), and 
therefore E(A)/SD(A) diverges less rapidly than E(A). 
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Table 4 Expected log likelihood ratio, E(A) (upper), and its SD, SD(A) (lower), for a subset of relationships using information on numbers 
(n 5 ), positions (p s ), and total lengths (t s ) of shared segments 
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0.00 
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14.85 
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26.08 


64.08 


60.25 


61.21 


110.08 
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105.92 


HS 


1.92 


0.00 


13.79 


16.24 


11.98 


16.23 


64.27 


58.20 


55.02 


107.60 


103.87 


101.06 


GPO 


20.50 


11.08 


0.00 


32.67 


19.68 


9.94 


72.61 


63.35 


53.23 


105.94 


101.19 


97.15 


C 


14.50 


13.00 


26.49 


0.00 


2.52 


12.58 


19.46 


20.21 


24.32 


51.41 


50.67 


51.94 


HUN 


18.91 


13.58 


14.86 


2.52 


0.00 


3.63 


17.39 


15.71 


16.27 


45.22 


43.35 


42.63 
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19.29 
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20.89 


17.01 
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44.29 


41.59 
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4.56 
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9.35 


8.93 
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11.87 


11.52 


HS 
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5.81 


6.19 


4.51 


5.11 


11.47 


10.27 


9.14 


13.26 


12.88 


12.50 


GPO 


5.34 


4.16 
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7.20 


4.77 
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14.18 
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17.51 


16.92 


16.78 
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5.17 


4.43 


7.21 
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5.51 


6.18 


5.97 


6.71 


10.43 


10.02 


9.99 


HUN 


6.71 


5.40 


4.93 


2.23 




2.99 


7.04 


6.23 


6.04 


11.56 


10.96 


10.58 


GGPO 


7.64 


6.49 


3.57 


4.17 


2.26 




8.75 


7.54 


6.28 


12.91 


12.21 


11.51 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-offspring; C, cousin; HUN, half-uncle-nephew; GGPO, great-grandparent-great-grandoffspring; 2C, second 
cousin; HC1R, half-cousin once removed; G4PO, great-great-great-grandparent-great-great-great-grandoffspring; 3C, third cousin; H2C1R, half second cousin once 
removed; G6PO, great-great-great-great-great-grandparent-great-great-great-great-great-grandoffspring. 

Lengths utilized only up to n s = 4 assuming a model human autosomal genome comprising 2 chromosomes of length 0.75 M, 8 chromosomes of length 1.25 M, 6 
chromosomes of length 1.75 M, 4 chromosomes of length 2.1 M, and 2 chromosomes of length 2.75 M. Rows denote the real relationship (A), columns denote the 
hypothesized relationship (B), and elements are E A [A(A : B)] (equation 1). 

Contributions from segment position and length 

The contributions of different components of the data to E(A)/SD(A) 
are illustrated in Figure 1 for some of the relationships in Table 5 and 
Table 6. It shows the ratio fitting only numbers of shared segments 
and shows the increments in the ratio by fitting positions and then 
lengths. A high proportion [in some cases almost all the information 
as judged by E(A)/SD(A)] is contained in the number of shared seg- 
ments. A litde more is added by including position, but only for close 
relationships when chromosomes ends are likely to be shared (Table 3). 
More information is obtained by incorporating length of chromosome 
shared, although not with a clearly defined pattern over relationships. 

Approximating likelihoods 

Equal chromosome lengths: To facilitate analysis of the distribution 
of log likelihood ratios, we consider a computational simplification, 
namely assuming all chromosomes have the same length rather than 
ranging over five different lengths. Hence, data also were simulated 



using a larger number of replicates (300,000) for chromosomes of 
length 1.632 M, the mean of those simulated previously, and likeli- 
hood ratios computed for genomes with 22 such chromosomes. Very 
similar values of E(A), SD(A), and, consequently, E(A)/SD(A) as those 
in Appendix Tables Al through A9 were obtained. Results in Appen- 
dix Table A10 for E(A)/SD(A) enable comparison direcdy with those 
in Appendix Table A9 computed using the five chromosome lengths 
model. In summary, of the 342 off-diagonal comparisons of E(A)/SD(A) 
for the 19 relationships, only 32 deviated by more than 2% and of these 
32, E(A)/SD(A) exceeded 1.0 in only 11, i.e., large proportional differ- 
ences typically occurred when absolute differences were small. 

Replication: Because differences in moments of A ascribed to different 
models can arise from differences in expectation and from sampling in 
the simulation, a further run of 300,000 replicates for chromosomes of 
length 1.632 M as in Appendix Table A10 was undertaken (results not 
shown). The differences in E(A)/SD(A) between the replicates were 
very small; of the 342 off-diagonal comparisons, only 1 1 differed by 



Table 5 Ratio of expected log likelihood ratio to its SD, E(A)/SD(A), using information on numbers, positions, and lengths of shared 
segments, as in Table 4: Sets of closely to moderately related pairs 
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4.77 
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2.14 
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1.86 


0.66 




0.73 




G3PO 


5.18 


4.60 


4.19 


2.31 


2.85 


1.90 


1.49 


1.43 


0.78 




UN, uncle- 


-nephew; HS, half 


-sib; GPO, grandparent-offspring; GUGN, great-uncle-great-nephew; C, cousin; 


HUN, half- 


uncle-nephew; 


G2PO, great-grandparent- 


great-grandoffspring; C1R, cousin once 


removed; HC, half-cousin; 


G3PO, great-great-grandparent-great-great-grandoffspring. 
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Table 6 Ratio of expected log likelihood ratio to its SD, E(A)/SD(A), using information on numbers, positions, and lengths of shared 
segments as in Table 5: Sets of more distantly related pairs 
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more than 2%, and of those E(A)/SD(A) exceeded 1.0 in only 5. The 
main results (e.g., Table 4, Table 5, Table 6 and corresponding Ap- 
pendix Tables) computed for five lengths of chromosome involved 
a total of 500,000 unequally weighted runs, rather than 300,000 
equally weighted runs (as we performed), so we conclude that suf- 
ficient replication was used. 

Higher moments and distributions of log 
likelihood ratios 

To simplify calculations, and in view of these results showing a good 
approximation of likelihood statistics computed for a model of 
chromosomes of equal length as that for chromosomes of different 
lengths, higher moments and distribution of A were computed as- 
suming all chromosomes had length 1.632 M (from simulations as in 
Appendix Table A 10). Coefficients of skew and kurtosis are given in 
Appendix Table All and Table A12, respectively, for a subset of 
relationships. In general, both coefficients are small, indicating close- 
ness to a normal distribution. The kurtosis coefficient is generally 
smaller than the skew, and kurtosis tends to be seen only in the 
presence of skew. The largest skew generally is found when the 
true relationship is weak and the assumed relationship is stronger, 
in which case there is negative skew. Positive skew is found less 
often, but typically when the assumed relationship is weaker than 
the true relationship. The apparent near-normality is not unex- 
pected because each sample is of size 22 and the central limit 
theorem applies (as it would to results simulated for samples from 
chromosomes of five different lengths). Examples of the distribu- 
tion of the log likelihood ratio, scaled as A/SD(A), are given in 
Figure 2, showing near-"normal" form as anticipated in these par- 
ticular examples. 

Approximations to sampling distributions 

The results we have used have been based entirely on simulation. We 
investigate, however, theoretical results available that could be used to 
obtain some more simply computed but potentially less informative 
tests of pedigree relationship. 

Based on work by Thomas et al. (1994), Huff et al. (2011) give 
an expression for the expected number of shared segments in the genome 



that, for a single chromosome, becomes the following equation 
(equation 2): 

E{n s ) =a(dl+l) 

where a is the number of ancestors (1 for half-sib mating, 2 for full- 
sib mating), d is the total number of meioses separating ancestors 
and descendants (back to the grandparents), and / is the map length. 
For lineal descendents, numbers of shared segments are typically 
one-half those of half-sib descendents, and the expected number 
shared with the grandparent (or founder of a recurrent backcross 
line) is [{d — 1)1 + 1] ( Vz) (rf— 15 , where d is the number of meioses 
back to the grandparent (i.e., founder, hence terms in d — 1 be- 
cause recombination to the parent is irrelevant). Thus, for example, 
R = 1/16 and d = 4 for full-sib-based (cousins once removed), 
half-sib-based (half cousins), and lineal descendents (great-great 
grandparent-great-great grandoffspring). The formulae do not ap- 
ply to the cases of uncle-nephew, for which (surmised from sim- 
ulations as in Table 2) E(« s ) = (5/ + 2)/4, or great-uncle-great-nephew, 
for which E(n s ) = (71 + 2)/8. The mean numbers of shared seg- 
ments from simulation agree (within sampling error) with predic- 
tion (Table 7). 

Huff et al. (2011) also state that given d, the expected length of 
a shared segment is lid, based on the calculations of length surround- 
ing a specific marker (Fisher 1949). They assume independence of 
numbers and length, implying from equation 2 that the expected total 
length of a chromosome shared is (dl + l)(ViY d ~ l Vd for half-sib 
descendents. However, because the expected proportion of the ge- 
nome shared is 2R = (V4)( d-1 \ the mean total length shared is actually 
l(V2y d ~ l \ It is partitioned over the expected number (dl + 
of shared segments and, therefore, taking into account the finite length 
of the chromosome, the expected length of an individual segment is 
II (dl + 1) = ll(d + III), not lid. These equations also hold for full-sib 
and lineal descendants. For example, for a chromosome of length 
1.632 M, the expected lengths of a shared segment are 0.383 M, 
0.277 M, and 0.217 M for half-sibs, half-uncle, and half-cousins, re- 
spectively, rather than 0.5 M, 0.333 M, and 0.25 M without the cor- 
rection. The proportionate difference becomes smaller for more 
distant relatives, e.g., 0.151 M rather than 0.167 M for half second 
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Figure 2 Distribution of log likelihood ratio, expressed relative to its 
SD, A/SD(A) using information on numbers, positions, and lengths of 
shared segments for examples of alternative pedigree relationships, 
e.g., real relationship uncle-nephew hypothesized relationship half-sib 
(UN-HS). C, cousin; HC, half-cousin; 2C, second cousin; 3C, third 
cousin; HC1R, half-cousin once removed; GUGN, great-uncle-great- 
nephew; G2PO, great-grandparent-great-grandoffspring. Model of 22 
chromosomes, each of 1.632 M. 

cousins. For uncle-nephew and great-uncle-great-nephew, the expected 
lengths of individual segments on a chromosome are, from simulation, 
2/(5/ + 2) and 2/(7/ + 2), respectively. 

Huff et at (2011) also made the simplifying assumption that the 
number of shared segments is Poisson-distributed, implying Var(« s ) = 
E(« s ) on individual chromosomes and the whole genome, but simu- 
lations show departures between mean and variance (Table 7). For 
a chromosome of length / = 1.632 M, the actual distribution is rather 
less dispersed than the Poisson for close relatives, but slightly more 
dispersed for more distant relatives. For cousins, for example, E(« s ) = 
1.882, V(« s ) = 1.311, and the proportion sharing no segments is ~10% 
(Table 2), but the Poisson expectation is ~15%. Further, the distribu- 
tion of shared segment lengths was assumed by Huff et al. to be 
independently exponentially distributed, in which case the coefficient 
of variation (CV) of the total length of shared segments on a chromo- 
some would be proportional to l/yjn s . For close relations who may 
share a high proportion of the chromosome, the actual distribution is 
substantially underdispersed compared with the Poisson and the CV 
of total length shared deviates from the l/\Jn s prediction. As relation- 
ships get more distant, these predictions hold better. 

Using approximate sampling distributions to 
distinguish relationships 

Because the predicted numbers (» s ) of shared segments (Huff et al. 
201 1) have the correct mean, they provide a simple route to likelihood 
calculations without simulations. Further, as illustrated in Figure 1, 
most of the information can be obtained from the numbers of shared 
segments without using their positions and length. As the actual dis- 
tribution departs from the Poisson (Table 7), however, there would be 
some reduction in discriminating power in computation of likeli- 
hoods, even from number of segments shared alone. To investigate 
this, we computed the log likelihood ratio for alternative types of 



relationships using data only regarding n s assuming it is Poisson- 
distributed, and we computed its mean and SD using the actual 
frequency distribution obtained from simulation. For simplicity, we 
assumed 22 chromosomes each of length 1.632 M. Examples are 
given in Table 8 for the log likelihood ratio computed using both 
the Poisson and the actual distributions. 

The log likelihood ratios remain zero when the real and assumed 
relationships are the same. In general, E(A) is smaller when the Pois- 
son approximation is used, but the proportional reduction is incon- 
sistent. There are cases when it is larger, which seems illogical, but 
there is no guarantee A decreases because the test is against a false 
hypothesis, with the actual distribution fitting closer to the Poisson 
with the wrong parameters. Because the SD is also substantially af- 
fected and typically is smaller, the ratio E(A)/SD(A) is often larger than 
that computed using the correct distribution obtained by simulation, 
but the pattern is not consistent. In view of this, such approximations 
should be used with care, and in any case we have provided an exact 
approach (strictly, more nearly exact, from replicate simulations). 

Extension to other species: impact of chromosome 
number and length 

Results have been given for a model human genome of c = 22 auto- 
somes with a total map length of L = 35.9 M; however, to assess how 
they need modifying for other species, we consider how c and L in- 
fluence results. We have shown that a model of 22 chromosomes of 
equal average length (1.632 M) approximates that with lengths rang- 
ing from 0.75 M to 2.75 M, with most in mid range. Therefore, if 
chromosomes have similar mean length to those of humans and the 
distribution of lengths is no more dispersed, moments for different 
numbers of chromosomes can be predicted well by scaling as E(A) « c 
and E(A)/SD(A) « ^jc because they are independent. To investigate the 
impact of wider variation in length we considered alternatives with 
total genome length 36 M, comprising 72 chromosomes each of 0.5 M 
or 12 chromosomes each of 3 M. 

Ability to discriminate, expressed in terms of E(X)/SD(A), is given 
for some examples of relationship in Appendix Table A13 using either 
numbers of segments alone or all sources, i.e., numbers, positions, and 
lengths. In summary, when there are many independent chromo- 
somes, E(A)/SD(A) is generally higher than when there are few, par- 
ticularly when Wright's relationship R differs, because probabilities of 
ibd at individual loci are mosdy uncorrelated with many small chro- 
mosomes. Independent loci do not provide evidence to distinguish 
relationships such as uncle-nephew and half-sibs having the same R. 
Information is contained in the distribution of number and length of 
shared segments, however, and the differences in E(A)/SD(A) between 
the 12 and 72 chromosome models for the same total map length are 
small, although generally higher for c = 72 when comparing relation- 
ships with different R. For relationships with the same R there is neg- 
ligible difference, e.g., real relationship half-sib, assumed uncle-nephew, 
E(A)/SD(A) = 0.97 for 72 chromosomes, 0.98 for 12 (Appendix Table 
A13) chromosomes, and 0.98 for 22 variable-length chromosomes 
(Table 5). Overall, therefore, the discriminating power clearly depends 
more on total amount of genome rather than on the individual chro- 
mosome lengths for the typical range of lengths in mammals. 

DISCUSSION 
Inference 

Although likelihood ratios are a natural way to describe the plausibility 
of alternative relationships, how to draw inferences from them is less 
clear-cut. Let CI denote the set of all pedigree relationships R under 
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Table 7 Mean and variance of number of shared segments on a chromosome of 1.632 M for different relationships obtained 
by simulation (300,000 replicates) 3 
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FS-based 


Mean 


Variance 


l_IC U, J 

no-Daseu 


Mean 


Variance 


Lineal 


Mean 


Variance 


1/4 


UN 


2.539 


1.170 


HS 


2.135 


0.942 
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0.537 
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1.482 


HUN 
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1.187 


GGPO 


1.066 


0.767 
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1.885 


1.311 














1/16 


C1R 


1.147 


1.213 


HC 


0.941 


1.020 


G3PO 


0.736 


0.721 


1/32 


2C 


0.674 


0.845 


HC1R 


0.572 


0.714 


G4PO 


0.471 


0.543 


1/64 


2C1R 


0.387 


0.519 


H2C 


0.336 


0.445 


G5PO 


0.287 


0.358 


1/128 


3C 


0.219 


0.299 


H2C1R 


0.193 


0.258 


G6PO 


0.169 


0.217 



FS, full-sib; HS, half-sib; UN, uncle-nephew; GPO, grandparent-offspring; GUGN, great-uncle-great-nephew; HUN, half-uncle-nephew; GGPO, great-grandparent- 
great-grandoffspring; C, cousin; C1R, cousin once removed; HC, half-cousin; G, great; G3PO, great-great-grandparent-great-great-grandoffspring; 2C, second 
cousin; HC1 R, half-cousin once removed; G4PO, great-great-great-grandparent-great-great-great-grandoffspring; 2C1R, second cousin once removed; H2C, half 
second cousin; G5PO, great-great-great-great-grandparent-great-great-great-great-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; 
G6PO, great-great-great-great-great-grandparent-great-great-great-great-grandoffspring. 

The mean number of shared segments from simulation agreed very closely with those expected from the formula. Of these 19 items, only three differences 

exceeded 0.001 and none exceeded 0.003. 



consideration. Because this is a finite set of discrete elements, it 
removes some of the difficulties in assigning prior probabilities 
when, typically, these are neither specified nor easy to specify. Bayes 
theorem can then be used to combine likelihoods and prior proba- 
bilities to produce a posterior distribution over the elements of ft. 
Unless some form of ordering, or measure of distance, is introduced 
in O, it is impossible to speak of means or variances of this distri- 
bution, but it will usually have a unique mode, and the correspond- 
ing relationship R will be our "best guess" at the true relationship. 
A confidence set could be obtained by ordering relationships by 
posterior probability and dropping relationships with the smallest 
probabilities until a desired probability level is achieved for the 
remainder. 

Without prior probabilities, everything hangs on the likelihood. 
The likelihood function is defined on ft, and the relationship R in ft 
that produces the maximum value of the likelihood is the maximum 
likelihood estimate of the true relationship, corresponding to the pos- 
terior mode with a uniform prior. Without a distance measure, and 
with discrete relationship classes, standard asymptotic results for max- 
imum likelihood estimates are not available. The distribution of the 
maximum likelihood estimate could be calculated by simulation, how- 
ever, assuming any particular R to be true. 

Any particular R can be tested as a null hypothesis against the 
general alternative that the true relationship is not R by using a max- 
imum likelihood ratio test (McPeek and Sun, 2000). The set of those R 
in ft for which this test is not significant at a given significance level 
constitutes a confidence set for the unknown relationship. McPeek 
and Sun (2000, p. 1079) point out that although the sampling distri- 
bution of log likelihood ratios for two fixed relationships is often close 
to a normal distribution (as we have shown previously; Figure 2, 
Appendix Tables All and A12), the sampling distribution of the 
maximized version tends to be skewed (the difference is between 
the estimate of R fixed or varying from sample to sample). Neverthe- 
less, even in the normal case, simulation is required to obtain the 
mean and variance of the null distribution. 

An issue that arises with both Bayesian and likelihood approaches 
is the completeness or otherwise of ft. The true relationship might be 
one we neglected to consider; it might be bilinear, but not so detected 
{e.g., paternal half-sibs and maternal second cousins), or an ancestor 
might be inbred so the probabilities of ibd sharing of descendents 
differ from those assumed here. Some relationships could be excluded 
based, for example, on ages of the individuals concerned, e.g., some 
lineal or avuncular relationships. 



If all that is required is to identify the "best guess" among all 
relationships under consideration, then we select the relationship with 
the largest likelihood, or the largest posterior probability. This can be 
regarded as a discrimination problem, with the relationships treated 
symmetrically. The two solutions correspond to the maximum likeli- 
hood or Bayes discriminate rules (Mardia et al. 1979), and the per- 
formance of such rules is judged by the set of misclassification 
probabilities. 

Discriminating between two relationships amounts to choosing 
one if the log likelihood ratio A > 0 and choosing the other if A < 0. 
If the two relationships are A and B, and the distribution of A is 
normal in each case, then the misclassification probabilities are 
<t>{ — m B /s B ) when we choose A, and $(— m A /s A ) when we choose 
B, where m A = E A [A(A : B)], i.e., the mean of A when A is the true 
relationship, and m B = E B [A(B : A)] (= — E B [A(A : B)]) when B is the 
true relationship and s A and s B are the corresponding SDs. Ratios of 
mis for various pairs of relationship are in Table 5 and Table 6, with 
more in Appendix Tables A7-A9. 

As an example, let us assume X dies intestate and a search locates 
one living relative, indisputably a half-cousin. Subsequently, Y appears 
claiming to be a cousin of X (but otherwise unrelated to Y), and thus is 
more closely related. Given only DNA data, can the claim be 
substantiated or disproved? There are two competing hypotheses: 
for A, Y is a cousin of X; and for B, Y is a half-cousin of X. To keep 
this argument simple, we discount other possible relationships. Given 
a prior probability that X and Y are cousins, the Bayesian approach 
provides a posterior probability, but it is not clear what a reasonable 
prior probability would be in the absence of any background infor- 
mation for Y. With the likelihood approach, we can clearly discrim- 
inate with confidence in this situation because both misclassification 
probabilities are small, -~1.5% using Table 5, <t>( — 2.22) ~ 0.013, 
i.e., if we decide half-cousins, and — 2.14) ~ 0.016 if we decide 
cousins. 

Taking as a simple criterion a difference of 2 SD in log likelihood 
ratio as an indicator of discriminating ability (corresponding to 
a misclassification probability of approximately 0.02), it is seen that 
although it is possible to distinguish between a distant and a close 
relationship with high power, it is more difficult between relationships 
of the same degree (R), increasingly so as R becomes smaller (Table 5 
and Table 6). There is littie power to discriminate between relation- 
ships for which R is 1/64 or less; for example, the probability of correct 
assignment (based simply on sign of the log likelihood ratio) is ap- 
proximately 3/4 for second cousins once removed vs. third cousins as 
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Table 8 Expected log likelihood ratio, E(A), and ratio to its SD, E(A)/SD(A), using data from numbers of shared segments (n s ) only 
computed from simulated data and also from the Poisson assumption, but with weights as for the simulated (actual) data 







Simulated 


Poisson 


Simulated 


Poisson 


Simulated Poisson 


E 


E/SD 


E 


E/SD 


E 


E/SD 


E 


E/SD 


E 


E/SD E 


E/SD 


R 


1/4 






UN 








HS 






GPO 




1/4 


UN 


0.00 





0.00 


, 


1.82 


0.90 


0.81 


0.91 


26.54 


2.43 9.81 


2.94 


1/8 


GUGN 


11.01 


1.86 


3.66 


1.54 


6.28 


1.41 


1.15 


0.84 


8.36 


1.25 1.01 


0.73 


1/8 


C 


5.64 


1.42 


2.05 


1.27 


2.30 


0.89 


0.33 


0.49 


9.05 


1.33 2.38 


1.24 


R 


1/8 






C 








HUN 






GGPO 




1/8 


C 


0.00 




0.00 




1.52 


0.87 


1.16 


0.89 


8.42 


1.76 5.62 


1 .84 


1/16 


C1 R 


5.75 


1.62 


3.68 


1.43 


1.38 


0.80 


0.87 


0.66 


1.29 


0.66 0.07 


0.19 


1/32 


2C 


16.22 


2.93 


11.35 


2.56 


7.73 


2.02 


6.00 


1.78 


3.90 


1.39 1.83 


0.92 


R 


1/32 






2C 








HC1R 






G4PO 




1/32 


2C 


0.00 




0.00 




0.15 


0.26 


0.19 


0.27 


0.76 


0.54 0.86 


0.54 


1/64 


2C1R 


1.35 


0.88 


1.59 


0.85 


0.66 


0.60 


0.75 


0.56 


0.29 


0.38 0.17 


0.26 


1/128 


3C 


3.91 


1.64 


4.59 


1.58 


2.70 


1.33 


3.13 


1.26 


6.00 


2.04 5.54 


1.78 


UN, uncle- 


-nephew; HS, h 


alf-sib; GPO, grandparent-offspring; GUGN, great-uncle-great-nephew; C, cousin; 


C1 R, cousin 


once rem 


oved; 2C, second 


cousin; 2C1 R, 



second cousin once removed; 3C, third cousin. 

Model of 22 chromosomes, each of 1.632 M. Actual relationships are in rows. Hypothesized relationships are in columns. 



E(A)/SD(A) ~ 0.6. It is easier to distinguish lineal relationships, e.g., 
great-great-great-grandparent-offspring from second cousins, than it 
is to distinguish second cousins from half-cousins once removed (for 
all of which R = 1/32) because the lineal recombination and trans- 
mission process differs more than that between half and full-sib 
descendants. 

Without use of information as shown here regarding shared 
genomic regions and merely considering resemblance locus by locus, 
relationships such as uncle-nephew and half-sib cannot be distin- 
guished at all. It is seen that E(A)/SD(A) ~ 1, whichever relationship 
is the real one. Hence, the likelihood ratio will be in the correct di- 
rection approximately 5/6 of the time — not certainty at a level looked 
for in significance tests, but not valueless. For more distant pairs with 
the same R, the probability of correct assignment will decline; for 
second cousins and half-cousins once removed (R = 1/32), the prob- 
ability declines to approximately 2/3. This illustrates the limitations of 
making decisions about the relationship between a pair of individuals 
even if based on full genomic data. 

Assumptions 

Many assumptions have been made in this analysis. The first is that 
the number of shared segments is accurately recorded, and the main 
risk is that short segments are missed. In population studies, Brown- 
ing and Browning (2013) and S.R. Browning (personal communica- 
tion) report good power to detect segments of 1.5 cM and higher 
using dense SNP data and 1 cM or higher with sequence data. For 
exponentially distributed segment lengths of expected length a (cM), 
this would imply a probability of missing an individual segment of 
approximately 1.5/a (lla) from SNP (sequence) data. For half-cousins, 
for example, the expected segment length is 21.7 cM for a chromosome 
of 1.632 M (see Results regarding approximations to sampling distri- 
butions), implying an approximately 7% chance of missing a random 
segment using SNPs, slightly less for closer relatives or using sequence 
data. Thus, there would be bias towards underestimating both 
Wright's and pedigree relationship, but little in comparing relatives 
with the same R. For known relatives, however, as considered here, the 
probability would be expected to be much lower because the individ- 
uals are already identified as relatives and not trawled from the pop- 
ulation. Errors in estimating segment length would be comparatively 
unimportant (Figure 1). 



Errors therefore will not necessarily lead to wrong assignment but 
to miscalculation of the likelihood ratios. As Table 2 and Table 3 
show, however, the pattern of numbers shared is unlikely to change 
greatly if the error rate is no more than a few percent, and the relative 
parameters for different relationships will remain approximately the 
same. A detailed analysis of consequences of errors is beyond the 
scope of this article, however. 

Further assumptions made when information on chromosome 
length is included are that a Haldane mapping function is appropriate 
and that map length can be accurately inferred from physical length 
of the chromosome. We consider the number of segments and the 
probability that shared segments reach chromosome ends would 
depend little or not at all on the mapping function. Problems might be 
encountered in measuring the segment length distribution, converted 
to map units, before using the data and methods presented here. If 
there are major experimental technical problems in measuring lengths 
or concern about the mapping functions or conversion from physical 
length, then that information could just be ignored with, for most 
pairings of R, little impact on discriminating ability (Figure 1). 

We also have taken no account of distant background relationship, 
assuming all genome sharing was due to recent common ancestry, 
whereas Huff et al. (2011) did so. Such sharing will bias predictions 
towards higher relationship. As in the example here, an extra rather 
than lost shared segment on one or two chromosomes will have little 
effect on likelihood calculations for fairly close relationships. Propor- 
tional errors become larger as relationships become more distant, but 
as results such as in Table 6 show, the power to discriminate among 
quite distant relationships is low in any case. 

General conclusions 

The results presented here show what can, in theory, be achieved 
in determining pedigree relationships from information on genome 
sharing. No further information is, in principle, available from 
analysis at the individual locus level (except perhaps from sequenc- 
ing and tracing point mutations in the pedigree). The low levels of 
expected likelihood ratios compared with their sampling error for 
pairs of quite distant relationships illustrate both how much variability 
in actual relationship in terms of shared genome comes from random 
Mendelian segregation and linkage and the consequent difficulty in 
assigning relationship. 



1562 I W. G. Hill and I. M. S. White 



"~5tG3'Genes I Genomes | Genetics 



ACKNOWLEDGEMENTS 

We thank Bruce Weir for helpful comments and discussion and 
reviewers for their useful criticisms. This work was supported by 
grants from the Leverhulme Trust to William G. Hill and from 
National Institutes of Health (GM 099568) to Bruce Weir, University 
of Washington, and by USS. 

LITERATURE CITED 

Abecasis, G. R., S. S. Cherny, W. O. Cookson, and L. R. Cardon, 

2002 Merlin - rapid analysis of dense genetic maps using sparse gene 

flow trees. Nat. Genet. 30: 97-101. 
Blouin, M. S., 2003 DNA-based methods for pedigree reconstruction and 

kinship analysis in natural populations. Trends Ecol. Evol. 18: 503-511. 
Browning, B. L., and S. R. Browning, 2013 Improving the accuracy and 

efficiency of identity-by-descent detection in population data. Genetics 

194: 459-471. 

Browning, S. R., and B. L. Browning, 2011 A fast, powerful method for 
detecting identity by descent. Am. J. Hum. Genet. 88: 173-182. 

Browning, S. R., and B. L. Browning, 2012 Identity by descent between 
distant relatives: detection and applications. Annu. Rev. Genet. 46: 617- 
633. 

Burnham, K. P., and D. R. Anderson, 2001 Kullback-Leibler information as 
a basis for strong inference in ecological studies. Wildl. Res. 28: 111-119. 

Donnelly, K. P., 1983 The probability that related individuals share some 
section of the genome identical by descent. Theor. Popul. Biol. 23: 34-64. 

Fisher, R. A., 1949 The theory of inbreeding, Oliver and Boyd, London. 

Fisher, R. A., 1954 A fuller theory of 'Junctions' in inbreeding. Heredity 8: 
187-197. 

Epstein, E., W. L. Duren, and M. Boenkhe, 2000 Improved inference of 
relationships for pairs of individuals. Am. J. Hum. Genet. 67: 1219-1231. 

Hill, W. G, and B. S. Weir, 2011 Variation in actual relationship as a con- 
sequence of Mendelian sampling and linkage. Genet. Res. 93: 47-74. 

Huff, C. D., D. J. Witherspoon, T. S. Simonson, J. C. Xing, W. S. Wa et al, 
2011 Maximum-likelihood estimation of recent shared ancestry 
(ERSA). Genome Res. 21: 768-774. 



Kong, X., K. Murphy, T. Raj, C. He, P. S. White et al, 2004 A combined 
physical-linkage map of the human genome. Am. J. Hum. Genet. 75: 
1143-1148. 

Kullback, S., and R. A. Leibler, 1951 On information and sufficiency Ann. 

Math. Stat. 22: 79-86. 
Kyriazopoulou-Panagiotopoulou, S., D. K. Haghighi, S. J. Aerni, A. Sundquist, 

S. Bercovici et al, 2011 Reconstruction of genealogical relationships with 

applications to Phase III of HapMap. Bioinformatics 27: 1333— 1341. 
Mardia, K. V., J. T. Kent, and J. M. Bibby, 1979 Multivariate Analysis, 

Academic Press, London. 
Marshall, T. C, J. Slate, L. E. B. Kruuk, and J. M. Pemberton, 

1998 Statistical confidence for likelihood-based paternity inference in 

natural populations. Mol. Ecol. 7: 639-655. 
McPeek, M. S., and L. Sun, 2000 Statistical tests for detection of misspecified 

relationships by use of genome-screen data. Am. J. Hum. Genet. 66: 1076-1094. 
Pemberton, J. M., 2008 Wild pedigrees: the way forward. Proc. Biol. Sci. 

275: 613-621. 

Purcell, S., B. Neale, K. Todd-Brown, L. Thomas, M. A. R. Ferreira et al, 
2007 PLINK: a toolset for whole-genome association and population- 
based linkage analysis. Am. J. Hum. Genet. 81: 559-575. 

Roberson, E. D. O., and J. Pevsner, 2009 Visualization of shared genomic 
regions and meiotic recombination in high-density SNP data. PLoS ONE 
4: e6711. 

Rohlfs, R. V., S. M. Fullerton, and B. S. Weir, 2012 Familial identification: 
population structure and relationship distinguishability. PLoS Genet. 8: 
el002469. 

Stam, P., 1980 The distribution of the fraction of the genome identical by 

descent in finite populations. Genet. Res. 35: 131-155. 
Thomas, A., M. H. Skolnick, and C. M. Lewis, 1994 Genomic mismatch 

scanning in pedigrees. IMA J. Math. Appl. Med. Biol. 11: 1-16. 
Wang, J., 2011 COANCESTRY: A program for simulating, estimating and 

analysing relatedness and inbreeding coefficients. Mol. Ecol. Resources 

11: 141-145. 

Weir, B. S., 1996 Genetic data analysis. II, Sinauer, Sunderland, MA. 

Communicating editor: D.-J. De Koning 



'-5iG3'Genes | Genomes | Genetics 



Volume 3 September 2013 I Identifying Pedigree Relationship I 1563 



<3 



U 



co 


o 


st 
o 


LO 
<> 


CO 

st 


CM 

o 


•o 
o 


CO 

o 


CO 


LO 




CM 

q 


o 

CO 


CO 

r-. 


O 


o 


CM 


CO 

O 


o 
o 


ro 


d 


"* 




st 


CM 


CM 


c> 




c> 




LO 


CO 






d 


d 


d 


d 


lo 
CO 


LO 


CO 
CM 


LO 
LO 


st 


o 


CO 


o 
st 


CO 


CO 


L> 


CO 

o 


LO 


LO 
CM 


st 


ro 

CO 


CO 

o 


o 
o 


CO 

o 


-o 
o 


st 

LO 


CM 
CO 


CO 
CO 


st 


CO 
CM 


CO 


r-v 


CM 


CO 


LO 




CM 




d 


d 


d 


d 


d 


CO 
CO 


CM 
CO 




CO 

c> 


o 

CO 


CM 
CO 


CM 
CM 




LO 

q 


CO 


CM 

q 


CO 
CO 


CO 


o 

CO 


CO 
st 


CM 
CM 


o 
q 


CO 

q 


o 




LO 


CO 


CO 


CO 


CM 


«~ 


























CN 

CO 


CO 


CO 


CO 
CO 


-o 


CO 
CO 


st 

CO 


st 


CO 


CO 
C> 


o 

CO 


c> 


rv 
o- 


o 

CO 


O 

q 


o 
q 


o 

CM 


LO 
CO 


CO 
LO 


o 

LO 


•o 
st 


st 
CM 


r — 

CM 


st 

CO 


CM 
CM 


CO 


CM 


CO 


si - 


CO 




o 


o 


o 


o 


o 


o 


o 


cm 

CN 


LO 


CO 


CO 


st 


o 


CM 


CO 




CO 


CO 
CM 


CM 


LO 
LO 


q 


o 
q 


00 

q 


CO 

st 


st 


o 

C> 


LO 


o 
st 


CM 
CM 


CM 
CM 


o 

CM 


CO 




o 




CO 


CM 




o 


o 


o 


o 


o 


o 


o 


LO 


o 


CO 


CM 

oq 


CO 


CM 


»q 


CO 




o 

CO 


O 
LO 




o 

CM 


o 
q 


q 


LO 
CM 




st 

q 


-o 

CO 


st 


CO 




T_ 


CM 






























cm 

LO 


o 

CO 


LO 


o 

00 


LO 
CO 


st 
o 


LO 


LO 
LO 


CO 
LO 


CM 




CO 
CM 


o 
o 


ro 

CM 


CM 
LO 


00 


O 
CO 


ro 


o 

LO 


st 


CO 
CO 


<- 




CM 


CO 






CO 




o 


o 


o 


o 


o 


o 




CM 


CM 


LO 

q 


CO 

o 


CO 


CO 
00 


5 


LO 
CO 


q 


q 


CO 




LO 


o 
q 


o 

CM 




CO 

q 


CM 


CM 


00 


CO 

-q 


CO 


CM 


>- 


»- 


«- 






























Ln 


LO 
CO 


CO 

o 


CO 

c> 


LO 


o 
st 


o 


o 

LO 


q 


CM 

>* 


o 
q 




CO 

o 


LO 

CO 


L> 


O 


5- 


st 
--J; 


q 


CO 


CM 


*- 




2- 






























CO 

st 


st 

~q 


st 

CO 




CM 

•o 


CO 

**? 


c> 


o 

CO 


o 


o 
q 


CO 

sr 


LO 


LO 

CO 


-o 


o 

CO 


CM 
CM 


LO 

a- 


o 

LO 


LO 
CM 


CO 
CO 


CM 
CM 




o 


st 


•o 




CM 


d 


d 


d 


d 




CM 


CO 


st 


LO 


o 




q 


st 

CO 


CO 
CM 


c> 


CM 


CM 

o 




5; 


o 
q 


LO 


o 
c> 


CM 
CO 


CO 

l> 


LO 


st 


CO 
LO 


LO 
St 


o 


LO 
L> 


CM 
CM 


st 


O 


st 


CO 


CO 




o 


o 


o 


o 




CM 


St 


LO 


o 


CO 


o 


o 




o 

LO 


st 

O 


st 


CM 

o- 


CO 


q 


o 
q 


o 

CO 






CN 


O 
CM 




CO 
CO 


<) 


CM 
CO 


o 


CO 
LO 


































' - 




<- 


LO 


LO 

o 


O 
CO 


O0 
LO 


o 

CO 


o 
st 


o 
q 


o 

CM 


CM 


LO 

q 


CO 


o 


CM 


CM 
CO 


CO 

-q 


o 
q 


CM 




CM 


st 

CM] 


st 


CM 


LO 


CO 


CM 


d 






CM 


CO 


LO 




o 


o 


CM 


st 


LO 




CO 

o 


o 


CO 
O 


o 
st 


CO 

st 


o 
o 


CO 
GO 


5; 


CO 




o 


CO 




LO 

q 


CO 


CM 
st 


o 

st 


o 

st 


■q 


s- 




























*~ 


,— 


CM 


CM 


CM 


o 


CO 
<3 


CO 

o 




o 
o 


CM 
LO 


CO 
CO 


o 


CM 


CO 
LO 


LO 

o 


CM 

o 


o 

CM 


o 


LO 


CO 


CO 

o 


-O 
CO 




st 




LO 


d 


d 






LO 


C> 


CO 




C> 


CM 
CM 


CM 


CO 
CM 


d 

CO 


st 
CO 


LO 

CO 


CO 


st 

LO 


o 


CO 


o 
o 


CO 
LO 


CM 

st 


CO 

-q 


o 

CO 


-o 


CM 
C> 






CM 


o 


CO 


o 

LO 


st 
LO 


st 






























«~ 


«~ 


CM 


CM 


CM 


CM 


LO 
CM 


■O 


o 
o 


st 


o 

CM 


st 
o 


o 


o 

LO 


o 

CM 


O 
LO 


LO 


CO 


o 


sr 
-o 


LO 


00 


CO 


st 
LO 


L> 


st 
CM 


* 


d 




CO 


st 


CM 




CO 


d 


sf 




d 

CM 


St 

CM 


<) 

CM 


ro 

CM 


CO 
CO 


st 
CO 


LO 

CO 


Ln 

CO 


o 
o 


o 


o 


•o 

CO 


o 


o 


LO 

c> 


CO 


LO 


o 


LO 


CO 
LO 




CO 
C> 


LO 
CM 


O 


CO 


st 




d 


c> 


<> 


CM 


CO 






CM 


CO 


<i 

CO 


5- 


<) 


CM 
LO 


LO 
LO 


c> 

LO 


st 
O 


-o 
o 


CO 


o 
o 


o 


CM 
CM 


CO 

o 


CO 


CO 

st 


st 
st 


st 

LO 


o 

CO 


CM 

q 


■o 

CO 


o 
ro 


o 

LO 


O 


o 


o 

CO 


O 


CM 


CO 
CO 


o 






d 


LO 


st 


<> 

CM 


<> 

CM 


CO 


CO 

«* 


CO 


CO 
LO 


c> 

LO 


-O 


c> 


CO 

[\ 


C> 


ro 


CO 
CO 



CU 



-C 0 ^ 



O ^ 

00 Q- 3 
IOOU 



.O O kOo: O ^ O 
■ Q- oi. , . o_ j- a. ,- U o- (Jo. 

lcM^L)CO(l(Jst(JCMLO(lCM<l 

:OUIOcMXOcMXlDmXC) 



1564 I W. G. Hill and I. M. S. White 



"~it:G3'Genes | Genomes | Genetics 



o 



o 



u 



o 



o 


ro 
O 


o 


o 




o 
o 


LO 

rM 


ro 
ro 


sf 


CM 


CO 
CO 


C> 


d 


d 


CO 


ro 


CO 
CM 


d 
rM 


LO 




ro 
LO 


o 
o 




CM 
O 


CM 


CM 

o 


ro 
CM 


CM 


CO 

•q 


O 
ro 


LO 
CO 




CO 

-o 


CO 
ro 


LO 


ro 


cm 


CO 


ro 


d 


O 
°^ 


o 
-o 


CO 

<> 


ro 




O 


ro 
c> 


CO 


CO 
<> 


ro 
co 


CO 






CO 




ro 


CM 








LD 


CO 
CO 


ro 


c> 


o 
rv 


st- 
op 


o 
-q 


o 


CO 

•q 


o 

CO 


o 


o 
-o 


si" 

lo 


CO 
CM 


LO 

ro 


CM 


CO 


CM 


CO 


LO 


CO 

q 


>* 

ro 


o 


LO 




CO 
ro 


o 
c> 


O 
■* 


ro 


CM 


LO 
O 


CO 
LO 


lo 

LT) 


LO 
CM 


ro 


CM 
CM 




o 


rv 


LO 


O" 


o 




ST 
LO 


o- 


ro 
c> 


rM 

LO 


o 

CO 




r-; 




LO 




CM 


CM 


CM 










CO 


o 

CO 




ro 
ro 


q 




CO 


CM 
<> 


LO 


O 


LO 


CM 


ro 


CO 


rM 




o 




ro 




o 




O 
CO 


CO 


o 


LO 

CO 


LO 
LO 


O 

q 


ro 


LO 


LO 


ro 


O 

ro 




CO 


<- 


o 




Osl 




O 


uo 


L> 




o 
q 


LO 
LO 


o 


LO 


ro 

LO 


O 
LO 




ro 




<- 




«- 










o 

LD 


O 
MD 


CO 
CM 




o 

CO 


o 


ro 
q 


CM 

q 


c> 


o 
q 


LO 

ro 


-o 

CM 


CM 


d 




O 


ro 


ro 


d 


d 


o 

O- 


CM 


CO 

•q 


o 


CM 


O 
MO 


-q 


LO 
LO 


o 
q 


LO 


<) 
CM 


CM 
CM 


LO 

r%i 


LO 


CO 


ro 


ro 


d 


d 


d 


ro 
CO 


LO 


LO 

c> 


ro 


CO 
CO 


O 
c> 


-o 
q 


o 
q 


r%i 

LO 


ro 


CM 


CM 


cm 
















CO 




O 

LO 


c> 


CO 


CM 


o 
q 


ro 




O 


CM 








d 


ro 


o 




Osl 


CM 


ro 


O 


-O 

q 


o 

LO 


a- 


O 
q 


ro 

LO 


CO 

■* 


LO 


r^- 
O- 






^! 






° 




CM 


ro 


LO 


rv 


ro 


o 
ro 


GO 


o 
o 


Lf) 

o- 


O 

LO 


LO 

c> 


LO 


-o 
-o 






r,i 
r%i 


d 


d 




CO 


LO 


L> 


ro 


o 


o 


OnI 


o 
o 


CM 


CM 
LO 


CO 


ro 
q 


O 
CO 


CM 
C> 


CO 


CO 


CO 


d 


d 


d 


sr 


ro 


sr 





O ^ ro ■<— O 



O ro 
^ ~ O 



o 
d 



o o o o 



CM l\ 

d d 



CO CNJ LO CO 

t- ^ ^ ro ro 



c\j n n o 
co ^ o ro 

CO 0 CN LO i IN 



o 



O CO 

co ro 



O CNJ <J N O 



CNCNLO^— O CM CO r- OO^ 

rorocorococNcocororo 



OONOOCOCOOCONCMOL^'t 



o -o o 



CNNCOO'tCOlilCO 
CMOJrO^'^-^LOLO 







o 


O 
O 


d 


d 


d 


d 


>* 


ro 
q 


o 
q 


q 


d 


d 


d 


d 


ro 


o 
q 


ro 
q 


ro 










O 
q 


o 
ro 


o 
>* 


LO 
LO 


d 


d 


d 


d 


o 






o 


d 


d 


d 


d 


rM 
ro 


CO 


LO 

q 


ro 










CM 

c> 


q 


ro 


CO 
<> 


o 


CM 


CM 


CM] 


ro 
<> 


CO 


LO 
CM 






CM 


ro 


ro 


CM 
<> 


C> 


O 


r%i 
q 










o 
■* 


O 


o 
o 


r%i 

LO 






-o 




■q 


CO 


CO 
CO 


q 




CO 


o 


o 


CM 
<> 


<5 
C> 


>* 


LO 








«~ 


CO 
C> 


CO 
t> 


-o 


CO 
ro 


CM 


LO 






O 
!\ 


LO 

q 


CO 

o- 


LO 




CM 


CM 






CO 

ro 


LO 


■* 

CO 


d 
ro 


ro 


Lf) 

ro 


-d 
ro 


CO 


LO 

q 


q 


ro 
q 


CM 




CM 


CM 


LO 


ro 


o 

LO 




CM 
ro 


ro 


ro 


CO 
ro 


ro 
CO 


O 

LO 


O 
CM 


CO 


d 
o 


<j 
O 


CO 


c> 




O 

LO 


o 


CM 




d 

CO 


CM 
CO 


CO 



6 o 2 



:9 £ 



O d 

00 D- Z) 

' I O O U 



O O kOii: O 

o- o; , , o- r; a- ^ U a. U o. 

CM^CJrOf)CJ^f()CMLO{JCMO 

OUIONlO«I13mIO 



- ° 



G3*Genes | Genomes | Genetics 



Volume 3 September 201 3 I Identifying Pedigree Relationship I 1565 



O cS 

Q_ 



co O <> ^~ 
in in ^ cm {> 



O O O LO 



in in 

n co 

in 06 

o o 



U 



X ^ <- ^ 



u 



o 

vO O CM 



CM CM 



o Mn o r- ^ 

<— O O LO 



O o - 

o_ co <- 

LO (> 

O co r-~ 



co >j -o o 



N CO m CM (N 



<— -o 



in cm <tf co co 

CO LO \ — t — LO 
UcONO^-Lrir^CNi 
CMcOOOOOOCOOOOO 



O <- CM CO 00 cm 

Q_ CM O CM CM CO 

^J" ^- LO CO C> 

O -O LO LO <— CM 



C£ LO O LO t— 

<— CM CM 00 IN CM 

^ d oo pi "O d 

X O LO -O t- CM 



CO N ! — O 

O CM O CM 

^ CN N 

O no IN <— 



00 CO 
IN O 



O JT « 

Q. CO IN 

CO o CM 

ID Nt co 



O -O O IN 

O 00 CM 

d -d >j 



in o in o lo 00 o 
(Jloooooo^j-o*— 

I in n 6 in cd ^' in 

CO 00 Nt 

„ MO ^ 
IN O O 



o. o 

CM <j 
O CM 



CM O IN LO vO O 



O 

o 



O q ^ co 

Z) CO CO LO 
(J - ^ CM 



O 
co 

I in d 



O O CO 
O O LO 



00 CO 00 o 



O CM O 
O O LO 

o ^ o 



o o 

in o. D 
X O O CJ 



o 


CO 


ro 




CM 


o 




Osl 


CM 


CO 


O 


O 


!\ 


in 


ro 




CO 


CM 


CO 






CM 


q 


q 


O 


-~6 






LO 




' — 


^~ 


d 


d 


d 


d 


C\l 




*~ 






















o 


Osl 








o 








Q 


CO 


CO 


ro 






CO 




Osj 


00 




o 


o 


o 




<) 


LO 


<j 








d 


d 


d 


d 


d 






' — 




















00 




LO 


o 


CO 


o 




o 


o 


o 




LO 


o 




Osl 


<) 


CO 




o 




o 


o 


o 


CM 




<) 




LO 






d 


d 


d 


d 


d 


d 






























CO 


LO 


o 






o 


o 


CM 


o 


o 


CO 


CN 






o 


CM 


o 


Osl 


o 


O 


LO 


o 






























o 


LO 








o 


CO 


CM 




Osl 


CO 


<) 


ro 


Osl 


■Nt 


■nJ- 




o 








o 


























o 


CO 


O 


Osl 


o 


o 


o 


o 


o 


CO 


o 




CM 


a- 


LO 




LO 


q 


q 






CO 


q 


5; 


o 


CO 


Qn 






CM 




o 










o 


CO 


O 


ro 


o 


O 


ro 




LO 






o 


CO 


in 


Osl 


O 




q 


ro 


q 


q 




r». 


c> 


«o 


CO 


osj 


■ 


o 




■ 






CM 


CM 


Osj 


CM 


CO 


o 




o 


CO 


0M 


ro 


LO 






CO 




in 


o 


ro 


q 






Osj 


<) 


CM 


«st 






CSI 




o 


















LO 






o 


LO 






ro 


o 


CM 


CO 


ro 




t> 


LO 


q 


CO 






c> 


<) 




LO 


q 


CO 


CM 














CM 






LO 


r-. 


ro 


o 


LO 


CO 


o 




O 


CO 


CO 




CO 


CM 


C\l 


o 


o 


o 


CO 


o 


CO 


o 




CO 


o 








ro 


CM 




LO 




LO 






CO 


CO 


o 


o 


Osl 


o 


CM 




ro 


o 


o 


CO 




CO 


o 


q 




CM 


q 






!\ 


LO 


CO 


ro 


o 


d 




Osj 


csi 


CO 


LO 


<) 


<) 


c> 


0< 


d 


o 






O 


LO 


CO 




LO 


O 


o 


CO 




o 


CO 


LO 


CO 




'nT 




LO 


^O 


CO 


o 


<3 


o 


d 


ro 


Osj 


CO 


LO 




CO 


C> 


CM 


CM 


ro 






















*~ 


,— 




Osl 


O 






CO 


Osl 




O 




o 






O; 












ro 




CM 


«st 




co 




ro 


d 




c> 






N^ 


CO 


CO 


CO 


CO 


o 


O 


o 




LO 


Osl 


Osl 




CM 






LO 


ro 




ro 


CO 


q 






o 


o 










<) 






CO 


CO 


CO 


c> 


CO 




«st 




















CM 


CM 


Osl 


uo 


o 


ro 




CM 


o 




o 


CO 


CM 


CM 






ro 




CO 


CO 




00 




CO 






ro 




O 


ro 


r^- 


o 


CM 


r-N, 


0^ 


o 


LO 


o 


r^- 


























o 


O 


LO 


^_ 


LO 


LO 


CO 


o 






CM 


^_ 




CM 


c> 




o 






Osl 




CO 


O 


o 




LO 




Osj 


CO 






d 




LO 


ui 


<) 








^ 








Osl 


CM 


CM 


CM 


Osl 


o 


CO 


o 


CO 






Osl 


ro 




o 


CM 






ro 






CO 










CO 






o 


Osl 






o 


o 


o 


o 




o 


o 


■ 

x 








ro 




















O 


' 


ro 


CO 






o 


LO 


CO 


LO 


CO 




LO 






CO 






CO 




<) 


LO 






ro 




CO 


o 


CM 




Osl 


*=st 


o 




Osl 


CO 


ro 


ro 




LO 


LO 




■o 










ro 






o 


o 


CO 


LO 


CO 


CM 


O 


o 




LO 


Osl 


r-N, 


o 


o 


O 




CO 


O 


LO 


o 


r^. 


CO 


Osl 










• 

"nT 


LO 


r^. 


' : n1" 


LO 


■o 


CO 






LO 


-o 


nO 


l\ 






CO 


CO 


CO 






o 






O 






o 




cc 


o 




u 


Q_ 






Q_ 




O 


Q_ 




U 


Q_ 




ro 


O 


u 




U 


Osl 


LO 


U 


CM 




U 


X 


O 


Osl 


X 




Osl 


X 


o 


CO 


X 


o 




















CO 






o 






Osl 












CM 












ro 











































CO 



:9 £ 



• •£ ° 



1566 I W. G. Hill and I. M. S. White 



"~5cG3'Genes | Genomes | Genetics 



Appendix Table A4 SD(A) as Table A1 using information only on numbers of shared segments 



R 






1/4 






1/8 






1/16 






1/32 






1/64 






1/128 






R 


UN 


HS 


GPO GUGN 


C 


HUN 


G2PO 


C1R 


HC 


G3PO 


2C 


HC1R 


G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R G6PO 


1/4 


UN 


0.00 


2.02 


8.10 


2.75 


2.43 


3.77 


6.93 


4.23 


5.05 


6.98 


5.55 


6.13 


7.50 


6.64 


7.08 


8.11 


7.65 


7.97 


8.73 




HS 


1.74 


0.00 


5.43 


2.02 


1.47 


2.49 


4.86 


3.29 


3.93 


5.39 


4.65 


5.11 


6.15 


5.78 


6.14 


6.94 


6.77 


7.04 


7.69 




GPO 


4.87 


3.74 


0.00 


2.53 


2.51 


2.13 


1.85 


2.67 


2.89 


3.16 


3.68 


3.95 


4.30 


4.77 


5.02 


5.36 


5.83 


6.03 


6.36 


1/8 


GUGN 


5.87 


4.40 


4.87 


0.00 


1.24 


1.07 


4.10 


2.18 


3.18 


5.02 


4.37 


5.06 


6.35 


6.20 


6.73 


7.71 


7.85 


8.26 


9.03 




C 


3.98 


2.58 


5.19 


0.99 


0.00 


1.70 


4.64 


2.85 


3.73 


5.47 


4.77 


5.39 


6.61 


6.37 


6.85 


7.78 


7.81 


8.17 


8.91 




HUN 


6.34 


4.87 


3.29 


0.85 


1.76 


0.00 


2.56 


1.55 


2.36 


3.78 


3.64 


4.23 


5.24 


5.42 


5.88 


6.67 


7.01 


7.37 


8.02 




G2PO 


7.51 


6.32 


2.69 


2.15 


3.12 


1.68 


0.00 


1.28 


1.47 


1.91 


2.59 


3.00 


3.52 


4.17 


4.53 


5.01 


5.68 


5.97 


6.40 


1/16 


C1R 


8.68 


7.25 


4.03 


2.16 


3.55 


1.73 


1.84 


0.00 


0.93 


2.44 


2.30 


2.93 


3.96 


4.23 


4.73 


5.52 


5.99 


6.37 


7.02 




HC 


9.45 


8.15 


4.45 


2.91 


4.34 


2.55 


1.62 


0.86 


0.00 


1.31 


1.43 


2.00 


2.86 


3.32 


3.78 


4.45 


5.05 


5.41 


5.98 




G3PO 


9.63 


8.55 


5.01 


3.44 


4.82 


3.20 


2.05 


1.65 


0.96 


0.00 


0.90 


1.24 


1.71 


2.45 


2.83 


3.29 


4.05 


4.36 


4.77 


1/32 


2C 


10.48 


9.33 


5.73 


4.06 


5.51 


3.82 


2.78 


2.14 


1.38 


0.95 


0.00 


0.56 


1.37 


1.81 


2.25 


2.86 


3.48 


3.83 


4.33 




HC1R 


10.56 


9.51 


6.08 


4.35 


5.78 


4.17 


3.16 


2.55 


1.83 


1.21 


0.52 


0.00 


0.73 


1.23 


1.63 


2.16 


2.83 


3.15 


3.59 




G4PO 


10.32 


9.42 


6.34 


4.48 


5.84 


4.37 


3.50 


2.85 


2.21 


1.59 


1.01 


0.58 


0.00 


0.77 


1.08 


1.46 


2.20 


2.49 


2.83 


1/64 


2C1R 


10.50 


9.63 


6.66 


4.90 


6.21 


4.79 


3.97 


3.34 


2.72 


2.17 


1.54 


1.10 


0.73 


0.00 


0.36 


0.83 


1.43 


1.71 


2.08 




H2C 


10.30 


9.50 


6.73 


4.96 


6.21 


4.88 


4.13 


3.49 


2.91 


2.40 


1.78 


1.37 


0.98 


0.33 


0.00 


0.44 


1.03 


1.30 


1.63 




G5PO 


9.93 


9.23 


6.74 


4.92 


6.10 


4.87 


4.23 


3.57 


3.04 


2.59 


1.99 


1.62 


1.24 


0.66 


0.37 


0.00 


0.69 


0.91 


1.17 


1/128 


3C 


9.59 


8.95 


6.68 


5.03 


6.11 


4.99 


4.41 


3.81 


3.33 


2.93 


2.38 


2.04 


1.71 


1.15 


0.88 


0.61 


0.00 


0.24 


0.52 




H2C1R 


9.33 


8.74 


6.61 


4.98 


6.02 


4.96 


4.43 


3.84 


3.38 


3.01 


2.47 


2.15 


1.85 


1.30 


1.05 


0.79 


0.22 


0.00 


0.27 




G6PO 


8.96 


8.43 


6.49 


4.87 


5.85 


4.86 


4.40 


3.80 


3.39 


3.06 


2.53 


2.23 


1.96 


1.44 


1.20 


0.95 


0.43 


0.24 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 



Appendix Table A5 SD(A) as Table A2 using information on numbers and position of shared segments 



R 






1/4 






1/8 






1/16 






1/32 






1/64 






1/128 






R 


UN 


HS 


GPO GUGN 


C 


HUN 


G2PO 


C1R 


HC 


G3PO 


2C 


HC1R G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R G6PO 


1/4 


UN 


0.00 


2.15 


8.56 


3.62 


3.81 


4.12 


6.91 


6.20 


6.14 


7.26 


8.16 


7.98 


8.43 


9.86 


9.74 


9.76 


11.58 


11.49 


11.08 




HS 


1.85 


0.00 


5.82 


3.91 


4.29 


3.63 


4.94 


6.63 


6.08 


6.25 


8.70 


8.18 


8.06 


10.38 


10.07 


9.85 


12.10 


11.69 


11.41 




GPO 


5.35 


4.17 


0.00 


7.16 


8.12 


6.28 


4.21 


10.71 


9.28 


7.62 


13.08 


11.81 


10.84 


14.31 


13.81 


13.30 


15.64 


14.71 


15.07 


1/8 


GUGN 


5.94 


4.64 


6.24 


0.00 


1.34 


1.17 


4.51 


2.72 


3.30 


5.01 


5.20 


5.51 


6.48 


7.39 


7.60 


8.16 


9.45 


9.60 


9.84 




C 


4.37 


3.50 


6.95 


1.13 


0.00 


1.97 


5.23 


3.07 


3.76 


5.50 


5.30 


5.65 


6.66 


7.26 


7.47 


8.07 


9.09 


9.24 


9.50 




HUN 


6.39 


5.00 


4.65 


0.94 


1.98 


0.00 


2.86 


2.61 


2.74 


3.81 


4.95 


5.03 


5.58 


7.06 


7.14 


7.46 


9.08 


9.09 


9.24 




G2PO 


7.53 


6.34 


3.26 


2.60 


3.68 


2.01 


0.00 


3.71 


3.10 


2.61 


5.57 


5.16 


4.95 


7.35 


7.19 


7.09 


9.18 


8.90 


9.01 


1/16 


C1R 


8.42 


6.98 


5.08 


2.31 


3.56 


2.06 


2.98 


0.00 


1.09 


2.80 


2.41 


2.95 


3.98 


4.49 


4.86 


5.54 


6.42 


6.69 


7.16 




HC 


9.24 


7.86 


4.68 


2.93 


4.34 


2.60 


2.23 


0.98 


0.00 


1.49 


1.88 


2.15 


2.87 


3.89 


4.13 


4.60 


5.81 


5.99 


6.32 




G3PO 


9.51 


8.37 


4.80 


3.44 


4.83 


3.20 


2.18 


1.91 


1.11 


0.00 


2.03 


1.88 


1.95 


3.61 


3.65 


3.80 


5.33 


5.37 


5.54 


1/32 


2C 


9.93 


8.63 


5.05 


4.00 


5.45 


3.75 


2.97 


2.15 


1.50 


1.50 


0.00 


0.66 


1.58 


1.87 


2.26 


2.87 


3.60 


3.90 


4.34 




HC1R 


10.13 


8.94 


5.26 


4.31 


5.74 


4.10 


3.16 


2.55 


1.85 


1.41 


0.59 


0.00 


0.82 


1.44 


1.70 


2.17 


3.10 


3.32 


3.67 




G4PO 


10.01 


9.00 


5.53 


4.46 


5.83 


4.33 


3.42 


2.86 


2.21 


1.62 


1.14 


0.65 


0.00 


1.29 


1.35 


1.55 


2.70 


2.83 


3.04 


1/64 


2C1R 


9.85 


8.82 


5.38 


4.74 


6.09 


4.59 


3.68 


3.32 


2.69 


2.20 


1.54 


1.15 


0.96 


0.00 


0.42 


0.94 


1.45 


1.73 


2.09 




H2C 


9.77 


8.82 


5.54 


4.84 


6.14 


4.72 


3.85 


3.48 


2.89 


2.38 


1.78 


1.38 


1.06 


0.38 


0.00 


0.49 


1.13 


1.33 


1.63 




G5PO 


9.52 


8.69 


5.67 


4.85 


6.07 


4.77 


4.00 


3.57 


3.03 


2.56 


1.99 


1.62 


1.25 


0.73 


0.41 


0.00 


0.92 


1.04 


1.21 


1/128 


3C 


8.97 


8.18 


5.34 


4.84 


5.98 


4.75 


4.00 


3.77 


3.25 


2.81 


2.37 


2.02 


1.72 


1.15 


0.90 


0.70 


0.00 


0.27 


0.58 




H2C1R 


8.80 


8.07 


5.38 


4.83 


5.92 


4.76 


4.07 


3.80 


3.32 


2.91 


2.47 


2.14 


1.84 


1.30 


1.05 


0.82 


0.25 


0.00 


0.31 




G6PO 


8.53 


7.87 


5.40 


4.76 


5.79 


4.72 


4.10 


3.79 


3.35 


2.98 


2.53 


2.23 


1.95 


1.44 


1.20 


0.96 


0.47 


0.26 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 
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Appendix Table A6 SD(A) as Table A3 using information on numbers, position, and length of shared segments 



K 






1 IA 

I /4 






1/8 






1 /1 L 
I / I O 






1/32 






1 ILA 

I /o4 






1/128 






R 


UN 


HS 


GPO GUGN 


C 


HUN G2PO 


C1R 


HC 


G3PO 


2C 


HC1R G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R G6PO 


1/4 


UN 


0.00 


2.16 


8.50 


4.44 


5.40 


4.56 


6.93 


8.16 


7.06 


7.46 


10.28 


9.35 


8.93 


11.50 


11.14 


10.50 


12.15 


1 1 .87 


1 1.52 




HS 


1.86 


0.00 


5.81 


5.1 1 


6.19 


4.51 


5.11 


9.18 


7.62 


6.87 


11.47 


10.27 


9.14 


12.69 


12.22 


11.25 


13.26 


12.88 


12.50 




GPO 


5.34 


4.16 


0.00 


8.26 


9.42 


7.20 


4.77 


12.88 


11.00 


8.74 


15.55 


14.18 


12.35 


16.77 


16.22 


15.22 


17.51 


16.92 


16.78 


1/8 


GUGN 


6.36 


5.11 


6.34 


0.00 


1.51 


1 .26 


4.63 


3.72 


3.55 


5.03 


6.69 


6.23 


6.70 


9.25 


8.81 


8.75 


11.32 


10.87 


10.71 




C 


5.17 


4.43 


7.21 


1.34 


0.00 


2.24 


5.51 


3.51 


3.81 


5.55 


6.18 


5.97 


6.71 


8.52 


8.17 


8.33 


10.43 


10.02 


9.99 




HUN 


6.71 


5.40 


4.93 


1.07 


2.23 


0.00 


2.99 


4.10 


3.31 


3.87 


7.04 


6.23 


6.04 


9.56 


8.91 


8.43 


11.56 


10.96 


10.58 




G2PO 


7.64 


6.49 


3.57 


3.04 


4.17 


2.26 


0.00 


5.94 


4.52 


3.22 


8.75 


7.54 


6.28 


11.08 


10.24 


9.17 


12.91 


12.21 


1 1.51 


1/16 


C1R 


8.04 


6.67 


5.37 


2.57 


3.64 


2.51 


3.69 


0.00 


1.37 


3.24 


2.65 


2.97 


4.06 


5.03 


5.05 


5.58 


7.18 


7.04 


7.30 




HC 


8.86 


7.50 


4.69 


2.98 


4.35 


2.72 


2.64 


1.26 


0.00 


1.70 


2.82 


2.46 


2.90 


5.18 


4.81 


4.84 


7.32 


6.94 


6.84 




G3PO 


9.21 


8.07 


4.71 


3.43 


4.82 


3.21 


2.33 


2.49 


1.39 


0.00 


3.78 


2.95 


2.32 


5.88 


5.22 


4.64 


7.86 


7.29 


6.81 


1/32 


2C 


8.66 


7.36 


4.48 


3.90 


5.32 


3.64 


3.1 3 


2.19 


1.72 


2.08 


0.00 


0.88 


1 .96 


2.01 


2.28 


2.95 


3.89 


3.98 


4.36 




HC1R 


9.07 


7.87 


4.55 


4.23 


5.67 


3.99 


3.11 


2.55 


1.90 


1.68 


0.82 


0.00 


1.02 


2.02 


1.88 


2.20 


3.82 


3.67 


3.79 




G4PO 


9.19 


8.16 


4.87 


4.42 


5.81 


4.27 


3.33 


2.86 


2.20 


1.69 


1.60 


0.89 


0.00 


2.48 


2.02 


1.76 


4.06 


3.72 


3.50 


1/64 


2C1R 


8.20 


7.12 


4.11 


4.39 


5.77 


4.15 


3.21 


3.26 


2.62 


2.20 


1.56 


1.26 


1.32 


0.00 


0.59 


1.24 


1.52 


1.74 


2.16 




H2C 


8.37 


7.38 


4.34 


4.59 


5.93 


4.39 


3.41 


3.45 


2.83 


2.32 


1.78 


1.40 


1.20 


0.56 


0.00 


0.65 


1.45 


1.42 


1.66 




G5PO 


8.39 


7.52 


4.64 


4.70 


5.95 


4.56 


3.66 


3.56 


3.00 


2.50 


2.00 


1.61 


1.28 


1.05 


0.58 


0.00 


1.64 


1.42 


1.33 


1/128 3C 


7.31 


6.47 


3.85 


4.37 


5.56 


4.18 


3.28 


3.63 


3.03 


2.51 


2.33 


1.97 


1.68 


1.15 


0.95 


0.90 


0.00 


0.38 


0.79 




H2C1R 7.36 


6.58 


4.04 


4.48 


5.62 


4.31 


3.45 


3.72 


3.17 


2.66 


2.45 


2.10 


1.79 


1.30 


1.06 


0.89 


0.37 


0.00 


0.43 




G6PO 


7.32 


6.61 


4.24 


4.51 


5.58 


4.39 


3.61 


3.75 


3.25 


2.80 


2.52 


2.21 


1.91 


1.44 


1.19 


0.97 


0.68 


0.40 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 



Appendix Table A7 E(A)/SD(A) as Tables A1 and A4 using information only on numbers of shared segments 



R 






1/4 






1/8 






1/16 






1/32 






1/64 






1/128 






R 


UN 


HS 


GPO 


GUGN 


C 


HUN 


G2PO 


C1R 


HC 


G3PO 


2C 


HC1R 


G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R 


G6PO 


1/4 


UN 


0.00 


0.91 


2.99 


2.38 


1.69 


2.67 


3.57 


3.82 


4.36 


4.79 


5.54 


5.88 


6.07 


7.01 


7.23 


7.31 


8.22 


8.39 


8.45 




HS 


0.96 


0.00 


2.15 


1.83 


1.10 


2.05 


2.89 


3.22 


3.78 


4.20 


4.92 


5.29 


5.51 


6.35 


6.61 


6.72 


7.58 


7.76 


7.82 




GPO 


3.54 


2.43 


0.00 


1.90 


2.03 


1.42 


1.25 


1.89 


2.16 


2.48 


3.01 


3.32 


3.62 


4.17 


4.41 


4.63 


5.17 


5.35 


5.49 


1/8 


GUGN 


1.86 


1.41 


1.53 


0.00 


0.54 


0.46 


1.36 


1.10 


1.54 


2.03 


2.27 


2.55 


2.81 


3.20 


3.38 


3.54 


3.94 


4.06 


4.17 




C 


1.44 


0.92 


1.60 


0.58 


0.00 


0.87 


1.79 


1.73 


2.20 


2.67 


3.04 


3.34 


3.61 


4.10 


4.30 


4.47 


4.97 


5.11 


5.21 




HUN 


2.28 


1.65 


1.23 


0.50 


0.86 


0.00 


0.97 


0.85 


1.28 


1.75 


2.05 


2.33 


2.60 


2.99 


3.18 


3.36 


3.75 


3.88 


3.99 




G2PO 


3.52 


2.68 


1.10 


1.69 


2.04 


1.12 


0.00 


0.79 


0.79 


1.03 


1.43 


1.67 


1.92 


2.31 


2.48 


2.66 


3.03 


3.16 


3.27 


1/16 


C1R 


3.06 


2.47 


1.64 


1.11 


1.61 


0.82 


0.70 


0.00 


0.44 


0.94 


1.13 


1.39 


1.66 


1.98 


2.14 


2.31 


2.63 


2.74 


2.85 




HC 


3.63 


2.98 


1.86 


1.59 


2.10 


1.25 


0.76 


0.45 


0.00 


0.53 


0.71 


0.96 


1.23 


1.55 


1.71 


1.88 


2.19 


2.30 


2.40 




G3PO 


4.47 


3.69 


2.11 


2.30 


2.81 


1.86 


1.00 


1.07 


0.59 


0.00 


0.46 


0.62 


0.83 


1.18 


1.34 


1.50 


1.81 


1.92 


2.02 


1/32 


2C 


4.61 


3.93 


2.58 


2.34 


2.91 


2.02 


1.39 


1.15 


0.72 


0.45 


0.00 


0.27 


0.56 


0.83 


0.99 


1.16 


1.44 


1.54 


1.64 




HC1R 


5.10 


4.37 


2.85 


2.69 


3.29 


2.35 


1.64 


1.46 


0.99 


0.62 


0.27 


0.00 


0.31 


0.58 


0.74 


0.91 


1.19 


1.30 


1.40 




G4PO 


5.78 


4.94 


3.16 


3.18 


3.82 


2.78 


1.92 


1.85 


1.35 


0.85 


0.62 


0.34 


0.00 


0.37 


0.51 


0.66 


0.97 


1.07 


1.17 


1/64 


2C1R 


6.31 


5.48 


3.70 


3.49 


4.19 


3.14 


2.35 


2.15 


1.66 


1.23 


0.88 


0.60 


0.38 


0.00 


0.17 


0.35 


0.63 


0.73 


0.83 




H2C 


6.77 


5.89 


3.97 


3.79 


4.53 


3.42 


2.57 


2.38 


1.87 


1.41 


1.07 


0.78 


0.53 


0.18 


0.00 


0.19 


0.47 


0.57 


0.68 




G5PO 


7.39 


6.42 


4.29 


4.19 


4.97 


3.78 


2.83 


2.69 


2.14 


1.63 


1.31 


1.00 


0.70 


0.39 


0.21 


0.00 


0.32 


0.42 


0.52 


1/128 


3C 


8.26 


7.24 


4.96 


4.68 


5.56 


4.29 


3.33 


3.10 


2.53 


2.03 


1.64 


1.34 


1.05 


0.67 


0.49 


0.33 


0.00 


0.11 


0.23 




H2C1R 


8.71 


7.64 


5.23 


4.94 


5.87 


4.53 


3.54 


3.30 


2.71 


2.19 


1.80 


1.48 


1.18 


0.80 


0.61 


0.44 


0.12 


0.00 


0.12 




G6PO 


9.30 


8.15 


5.54 


5.30 


6.28 


4.85 


3.77 


3.56 


2.94 


2.37 


1.98 


1.65 


1.32 


0.94 


0.75 


0.55 


0.24 


0.13 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 
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Appendix Table A8 E(A)/SD(A) as Tables A2 and A5 using information on numbers and position of shared segments 



R 






1/4 






1/8 






1/16 






1/32 






1/64 






1/128 






R 


UN 


HS 


GPO 


GUGN 


C 


HUN 


G2PO 


C1R 


HC 


G3PO 


2C 


HC1R 


G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R 


G6PO 


1/4 


UN 


0.00 


0.97 


3.27 


2.47 


2.04 


2.78 


3.59 


3.84 


4.38 


4.90 


5.24 


5.66 


6.10 


6.37 


6.68 


7.12 


7.24 


7.44 


7.95 




HS 


1.02 


0.00 


2.37 


2.05 


1.80 


2.20 


2.94 


3.25 


3.69 


4.27 


4.45 


4.84 


5.32 


5.51 


5.79 


6.18 


6.34 


6.60 


6.92 




GPO 


3.83 


2.66 


0.00 


2.55 


2.76 


2.24 


1.78 


2.80 


2.77 


2.79 


3.29 


3.38 


3.47 


3.92 


4.00 


4.09 


4.52 


4.67 


4.65 


1/8 


GUGN 


2.12 


1.95 


2.64 


0.00 


0.60 


0.51 


1.54 


1.24 


1.57 


2.03 


2.34 


2.57 


2.83 


3.19 


3.36 


3.55 


3.84 


3.96 


4.13 




C 


1.95 


1.94 


2.93 


0.64 


0.00 


0.99 


2.06 


1.75 


2.20 


2.69 


3.02 


3.32 


3.60 


3.99 


4.21 


4.43 


4.75 


4.90 


5.12 




HUN 


2.42 


2.00 


2.44 


0.55 


0.99 


0.00 


1.12 


1.11 


1.35 


1.76 


2.13 


2.36 


2.62 


2.96 


3.14 


3.33 


3.61 


3.74 


3.91 




G2PO 


3.54 


2.74 


1.94 


1.84 


2.31 


1.26 


0.00 


1.36 


1.18 


1.16 


1.76 


1.85 


1.98 


2.38 


2.49 


2.62 


2.93 


3.06 


3.14 


1/16 


C1R 


3.56 


3.26 


3.52 


1.31 


1.67 


1.21 


1.46 


0.00 


0.51 


1.08 


1.14 


1.39 


1.66 


1.98 


2.14 


2.31 


2.62 


2.72 


2.84 




HC 


3.94 


3.48 


3.49 


1.64 


2.11 


1.37 


1.32 


0.53 


0.00 


0.61 


0.81 


0.99 


1.24 


1.59 


1.73 


1.88 


2.18 


2.28 


2.39 




G3PO 


4.62 


3.96 


3.29 


2.30 


2.83 


1.87 


1.24 


1.23 


0.67 


0.00 


0.78 


0.77 


0.87 


1.31 


1.40 


1.53 


1.84 


1.93 


2.02 


1/32 


2C 


5.17 


4.70 


4.60 


2.57 


3.02 


2.35 


2.19 


1.19 


0.88 


0.86 


0.00 


0.31 


0.63 


0.83 


0.99 


1.16 


1.44 


1.54 


1.64 




HC1R 


5.52 


4.96 


4.54 


2.81 


3.34 


2.54 


2.20 


1.46 


1.04 


0.85 


0.32 


0.00 


0.35 


0.62 


0.75 


0.91 


1.21 


1.30 


1.40 




G4PO 


6.08 


5.38 


4.49 


3.23 


3.83 


2.87 


2.26 


1.86 


1.35 


0.93 


0.71 


0.38 


0.00 


0.52 


0.57 


0.68 


1.02 


1.10 


1.18 


1/64 


2C1R 


6.93 


6.30 


5.64 


3.75 


4.34 


3.48 


3.05 


2.20 


1.80 


1.55 


0.89 


0.67 


0.58 


0.00 


0.20 


0.40 


0.63 


0.73 


0.83 




H2C 


7.30 


6.59 


5.67 


3.97 


4.62 


3.67 


3.12 


2.41 


1.96 


1.63 


1.07 


0.81 


0.63 


0.20 


0.00 


0.22 


0.49 


0.58 


0.67 




G5PO 


7.82 


7.00 


5.73 


4.30 


5.02 


3.94 


3.24 


2.70 


2.18 


1.75 


1.31 


1.01 


0.73 


0.44 


0.23 


0.00 


0.38 


0.44 


0.52 


1/128 


3C 


8.98 


8.14 


6.88 


4.97 


5.74 


4.64 


3.99 


3.18 


2.69 


2.31 


1.67 


1.40 


1.19 


0.68 


0.53 


0.42 


0.00 


0.13 


0.25 




H2C1R 


9.36 


8.46 


6.99 


5.18 


6.00 


4.83 


4.10 


3.35 


2.82 


2.40 


1.81 


1.52 


1.27 


0.80 


0.63 


0.48 


0.13 


0.00 


0.14 




G6PO 


9.88 


8.87 


7.11 


5.47 


6.37 


5.08 


4.24 


3.58 


3.01 


2.52 


1.98 


1.66 


1.38 


0.95 


0.75 


0.57 


0.27 


0.15 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 



Appendix Table A9 E(A)/SD(A) as Tables A3 and A6 using information on numbers, position, and length of shared segments 



R 






1/4 






1/8 






1/16 






1/32 






1/64 






1/128 






R 


UN 


HS 


GPO 


GUGN 


C 


HUN 


G2PO 


C1R 


HC 


G3PO 


2C 


HC1R 


G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R 


G6PO 


1/4 


UN 


0.00 


0.98 


3.28 


2.95 


2.75 


3.18 


3.76 


4.69 


5.04 


5.40 


6.23 


6.44 


6.86 


7.71 


7.71 


8.03 


9.06 


9.05 


9.19 




HS 


1.03 


0.00 


2.37 


2.62 


2.62 


2.66 


3.17 


4.24 


4.37 


4.77 


5.60 


5.67 


6.02 


6.92 


6.84 


7.03 


8.11 


8.07 


8.09 




GPO 


3.84 


2.66 


0.00 


3.13 


3.47 


2.73 


2.09 


3.96 


3.64 


3.39 


4.67 


4.47 


4.31 


5.42 


5.25 


5.08 


6.05 


5.98 


5.79 


1/8 


GUGN 


2.64 


2.63 


3.27 


0.00 


0.70 


0.56 


1.68 


1.50 


1.64 


2.03 


2.57 


2.68 


2.87 


3.41 


3.48 


3.62 


4.09 


4.14 


4.23 




C 


2.80 


2.94 


3.67 


0.73 


0.00 


1.13 


2.28 


1.86 


2.22 


2.69 


3.15 


3.39 


3.62 


4.12 


4.29 


4.48 


4.93 


5.06 


5.20 




HUN 


2.82 


2.51 


3.02 


0.60 


1.13 


0.00 


1.22 


1.52 


1.49 


1.77 


2.47 


2.52 


2.69 


3.26 


3.30 


3.43 


3.91 


3.96 


4.03 




G2PO 


3.72 


2.97 


2.30 


1.92 


2.50 


1.34 


0.00 


1.94 


1.57 


1.33 


2.39 


2.26 


2.21 


2.94 


2.87 


2.86 


3.43 


3.41 


3.40 


1/16 


C1R 


4.79 


4.75 


4.97 


1.71 


1.85 


1.80 


2.20 


0.00 


0.65 


1.32 


1.19 


1.39 


1.68 


2.03 


2.15 


2.30 


2.66 


2.75 


2.84 




HC 


4.76 


4.46 


4.77 


1.75 


2.14 


1.60 


1.86 


0.66 


0.00 


0.73 


1.03 


1.05 


1.23 


1.73 


1.79 


1.90 


2.29 


2.35 


2.42 




G3PO 


5.18 


4.60 


4.19 


2.31 


2.85 


1.90 


1.49 


1.43 


0.78 


0.00 


1.21 


1.02 


0.95 


1.63 


1.59 


1.61 


2.07 


2.07 


2.10 


1/32 


2C 


6.82 


6.62 


6.99 


3.11 


3.35 


3.10 


3.34 


1.28 


1.23 


1.46 


0.00 


0.42 


0.83 


0.86 


0.99 


1.17 


1.46 


1.55 


1.64 




HC1R 


6.79 


6.40 


6.57 


3.09 


3.49 


2.96 


3.05 


1.47 


1.16 


1.24 


0.43 


0.00 


0.46 


0.74 


0.78 


0.91 


1.28 


1.33 


1.40 




G4PO 


7.07 


6.47 


5.97 


3.33 


3.87 


3.06 


2.76 


1.90 


1.37 


1.07 


0.88 


0.48 


0.00 


0.81 


0.72 


0.72 


1.18 


1.19 


1.22 


1/64 


2C1R 


9.05 


8.65 


8.87 


4.44 


4.82 


4.37 


4.50 


2.37 


2.20 


2.29 


0.95 


0.89 


1.01 


0.00 


0.28 


0.54 


0.64 


0.73 


0.84 




H2C 


9.06 


8.52 


8.32 


4.42 


4.93 


4.27 


4.19 


2.48 


2.17 


2.11 


1.09 


0.88 


0.87 


0.29 


0.00 


0.30 


0.55 


0.59 


0.67 




G5PO 


9.29 


8.57 


7.78 


4.56 


5.19 


4.31 


3.96 


2.72 


2.26 


2.01 


1.35 


1.02 


0.82 


0.57 


0.31 


0.00 


0.55 


0.52 


0.54 


1/128 


3C 


11.56 


10.93 


10.59 


5.79 


6.37 


5.65 


5.56 


3.41 


3.13 


3.08 


1.76 


1.62 


1.62 


0.72 


0.65 


0.70 


0.00 


0.19 


0.35 




H2C1R 


11.63 


10.88 


10.12 


5.79 


6.47 


5.60 


5.33 


3.49 


3.12 


2.96 


1.85 


1.63 


1.54 


0.82 


0.67 


0.63 


0.19 


0.00 


0.20 




G6PO 


11.86 


10.96 


9.69 


5.90 


6.68 


5.63 


5.17 


3.65 


3.18 


2.89 


2.00 


1.71 


1.52 


0.98 


0.77 


0.62 


0.37 


0.21 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 
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Appendix Table A10 E(A)/SD(A) evaluated using a model of 22 chromosomes, each of length 1.632 M, using information as Table A9 
on numbers, position, and length of segments 



K 






1 IA 
I/4 






1/8 






1 /1 A 






I / jZ 






1 /AA 
I /o4 






1 /1 9P 
I / I Zo 






R 


UN 


HS 


GPO 


GUGN 


C 


HUN 


G2PO 


C1R 


HC 


G3PO 


2C 


HC1R 


G4PO 


2C1R 


H2C 


G5PO 


3C 


H2C1R 


G6PO 


1/4 


UN 


0.00 


0.97 


3.25 


2.97 


2.78 


3.19 


3.75 


4.70 


5.04 


5.35 


6.18 


6.43 


6.80 


7.46 


7.67 


7.95 


8.79 


8.75 


8.98 




HS 


1.03 


0.00 


2.37 


2.65 


2.65 


2.67 


3.17 


4.26 


4.39 


4.77 


5.57 


5.63 


6.01 


6.68 


6.78 


7.00 


7.92 


7.82 


7.86 




GPO 


3.83 


2.66 


0.00 


3.15 


3.47 


2.74 


2.09 


3.92 


3.62 


3.39 


4.64 


4.35 


4.31 


5.33 


5.15 


4.98 


6.00 


5.85 


5.65 


1/8 


GUGN 


2.66 


2.65 


3.25 


0.00 


0.72 


0.56 


1.68 


1.51 


1.64 


2.00 


2.56 


2.69 


2.85 


3.37 


3.48 


3.60 


4.04 


4.10 


4.20 




C 


2.84 


2.96 


3.63 


0.75 


0.00 


1.13 


2.28 


1.88 


2.24 


2.68 


3.15 


3.42 


3.62 


4.12 


4.33 


4.50 


4.91 


5.04 


5.20 




HUN 


2.83 


2.53 


3.01 


0.59 


1.14 


0.00 


1.22 


1.54 


1.50 


1.76 


2.47 


2.53 


2.69 


3.23 


3.31 


3.43 


3.88 


3.93 


4.00 




G2PO 


3.72 


2.97 


2.29 


1.92 


2.50 


1.33 


0.00 


1.94 


1.57 


1.32 


2.39 


2.25 


2.21 


2.89 


2.85 


2.85 


3.40 


3.38 


3.35 


1/16 


C1R 


4.83 


4.81 


4.94 


1.72 


1.87 


1.82 


2.21 


0.00 


0.65 


1.32 


1.19 


1.40 


1.67 


2.03 


2.17 


2.30 


2.66 


2.76 


2.85 




HC 


4.79 


4.50 


4.77 


1.74 


2.16 


1.61 


1.87 


0.67 


0.00 


0.72 


1.04 


1.06 


1.22 


1.73 


1.80 


1.89 


2.29 


2.35 


2.42 




G3PO 


5.17 


4.59 


4.17 


2.30 


2.85 


1.88 


1.47 


1.43 


0.77 


0.00 


1.23 


1.04 


0.96 


1.64 


1.61 


1.62 


2.08 


2.09 


2.11 


1/32 


2C 


6.85 


6.67 


6.97 


3.11 


3.37 


3.11 


3.35 


1.27 


1.23 


1.48 


0.00 


0.41 


0.83 


0.86 


0.99 


1.16 


1.47 


1.55 


1.64 




HC1R 


6.83 


6.46 


6.57 


3.09 


3.52 


2.97 


3.06 


1.47 


1.16 


1.26 


0.42 


0.00 


0.45 


0.74 


0.78 


0.89 


1.28 


1.34 


1.40 




G4PO 


7.05 


6.46 


5.96 


3.31 


3.87 


3.04 


2.74 


1.89 


1.34 


1.07 


0.88 


0.47 


0.00 


0.81 


0.73 


0.72 


1.19 


1.21 


1.23 


1/64 


2C1R 


9.03 


8.67 


8.84 


4.42 


4.82 


4.36 


4.49 


2.36 


2.19 


2.30 


0.93 


0.87 


1.01 


0.00 


0.27 


0.53 


0.64 


0.73 


0.84 




H2C 


9.09 


8.57 


8.34 


4.41 


4.95 


4.28 


4.20 


2.48 


2.16 


2.13 


1.08 


0.87 


0.88 


0.28 


0.00 


0.28 


0.54 


0.59 


0.67 




G5PO 


9.26 


8.56 


7.78 


4.52 


5.17 


4.29 


3.94 


2.69 


2.23 


2.01 


1.33 


0.99 


0.80 


0.56 


0.29 


0.00 


0.56 


0.54 


0.55 


1/128 


3C 


11.57 


10.98 


10.62 


5.78 


6.38 


5.65 


5.56 


3.40 


3.12 


3.10 


1.75 


1.60 


1.62 


0.71 


0.63 


0.70 


0.00 


0.18 


0.35 




H2C1R 


11.71 


10.98 


10.18 


5.81 


6.52 


5.63 


5.36 


3.50 


3.13 


3.00 


1.85 


1.63 


1.56 


0.82 


0.66 


0.64 


0.18 


0.00 


0.19 




G6PO 


11.87 


10.99 


9.71 


5.88 


6.69 


5.63 


5.17 


3.64 


3.17 


2.91 


1.99 


1.69 


1.53 


0.97 


0.75 


0.62 


0.36 


0.19 


0.00 



UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; GUGN, great-uncle-great-nephew; C, cousin; HUN, half-uncle-nephew; G2PO, great-grand- 
parent-grandoffspring; C1 R, cousin once removed; HC, half-cousin; G3PO, great-great-grandparent-grandoffspring; 2C, second cousin; HC1 R, half-cousin once 
removed; G4PO, great-great-great-grandparent-grandoffspring; 2C1R, second cousin once removed; H2C, half second cousin; G5PO, great-great-great-great- 
grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great-great-great-great-great-grandparent-grandoffspring. 



Appendix Table A11 Skew coefficient of log likelihood ratio for a subset of relationships for the model in Table A10 
of 22 chromosomes, each of 1.632 M, using information on numbers, positions, and lengths of shared segments 



R 






1/4 






1/8 






1/32 






1/128 






R 


UN 


HS 


GPO 


C 


HUN 


G2PO 


2C 


HC1R 


G4PO 


3C 


H2C1R 


G6PO 


1/4 


UN 




0.15 


0.11 


-0.03 


-0.21 


-0.01 


-0.09 


-0.13 


-0.17 


-0.17 


-0.16 


-0.17 




HS 


-0.16 




0.12 


0.05 


-0.13 


-0.13 


-0.06 


-0.07 


-0.16 


-0.16 


-0.15 


-0.14 




GPO 


-0.20 


-0.18 




0.09 


0.09 


0.04 


0.02 


0.06 


0.02 


-0.08 


-0.06 


-0.03 


1/8 


C 


-0.09 


-0.20 


0.08 




-0.02 


0.10 


-0.05 


-0.13 


-0.09 


-0.06 


-0.08 


-0.11 




HUN 


0.02 


-0.01 


-0.02 


-0.06 




0.26 


0.15 


0.08 


0.02 


0.05 


0.04 


0.02 




G2PO 


-0.02 


-0.01 


-0.23 


-0.08 


-0.14 




0.23 


0.25 


0.16 


0.11 


0.12 


0.12 


1/32 


2C 


-0.25 


-0.28 


-0.42 


-0.20 


-0.38 


-0.46 




0.19 


0.32 


0.21 


0.18 


0.18 




HC1R 


-0.23 


-0.24 


-0.45 


-0.18 


-0.27 


-0.46 


-0.08 




0.45 


0.31 


0.24 


0.20 




G4PO 


-0.26 


-0.27 


-0.43 


-0.22 


-0.27 


-0.38 


0.02 


-0.10 




0.47 


0.40 


0.33 


1/128 


3C 


-0.51 


-0.53 


-0.78 


-0.48 


-0.56 


-0.72 


-0.48 


-0.60 


-0.85 




0.61 


0.59 




H2C1R 


-0.57 


-0.58 


-0.76 


-0.53 


-0.59 


-0.69 


-0.51 


-0.57 


-0.71 


-0.15 




0.65 




G6PO 


-0.59 


-0.60 


-0.73 


-0.56 


-0.60 


-0.66 


-0.55 


-0.56 


-0.62 


-0.07 


0.02 





UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; C, cousin; HUN, half-uncle-nephew; G2PO, great-grandparent-grandoffspring; 2C, second 
cousin; HC1 R, half-cousin once removed; G4PO, great-great-great-grandparent-grandoffspring; 3C, third cousin; H2C1 R, half second cousin once removed; G6PO, 
great-great-great-great-great-grandparent-grandoffspring. 
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Appendix Table A12 Kurtosis coefficient of log likelihood ratio, otherwise as Table A11 



K 






1 IA 

I /4 






I /o 






1 /QO 

I / 6Z. 






1 /1 OQ 
I / I ZO 






R 


UN 


HS 


GPO 


C 


HUN 


G2PO 


2C 


HC1R 


G4PO 


3C 


H2C1R 


G6PO 


1/4 


UN 




0.04 


0.02 


0.02 


0.09 


0.03 


0.02 


0.04 


0.08 


0.04 


0.03 


0.04 




HS 


0.00 




0.00 


-0.05 


0.01 


0.07 


-0.05 


-0.02 


0.03 


0.00 


0.00 


0.03 




GPO 


0.03 


0.03 




-0.02 


-0.04 


-0.05 


-0.06 


-0.05 


-0.05 


-0.06 


-0.05 


-0.05 


1/8 


C 


0.01 


0.07 


0.11 




0.00 


0.05 


-0.01 


-0.02 


-0.02 


-0.01 


-0.01 


-0.01 




HUN 


0.00 


-0.01 


0.07 


-0.03 




0.11 


0.02 


0.00 


0.01 


-0.02 


-0.01 


-0.01 




G2PO 


-0.01 


-0.01 


0.04 


0.11 


0.15 




0.05 


0.07 


0.01 


-0.02 


0.00 


-0.01 


1/32 


2C 


-0.04 


0.00 


0.36 


-0.08 


0.09 


0.40 




0.08 


0.12 


-0.07 


-0.09 


-0.10 




HC1R 


-0.02 


-0.01 


0.21 


-0.05 


0.01 


0.21 


0.24 




0.53 


0.02 


-0.01 


-0.04 




G4PO 


-0.03 


-0.03 


0.14 


-0.06 


-0.03 


0.08 


0.45 


0.57 




0.17 


0.10 


0.04 


1/128 


3C 


0.22 


0.25 


0.72 


0.16 


0.30 


0.59 


0.17 


0.37 


0.87 




1.42 


0.75 




H2C1R 


0.32 


0.33 


0.67 


0.27 


0.35 


0.52 


0.26 


0.31 


0.54 


1.64 




1.55 




G6PO 


0.32 


0.32 


0.53 


0.28 


0.32 


0.42 


0.31 


0.29 


0.35 


1.13 


2.49 





UN, uncle-nephew; HS, half-sib; GPO, grandparent-grandoffspring; C, cousin; HUN, half-uncle-nephew; G2PO, great-grandparent-grandoffspring; 2C, second 
cousin; HC1R, half-cousin once removed; G4PO, great-great-great-grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, 
great-great-great-great-great-grandparent-grandoffspring. 



Appendix Table A13 Influence on E(A)/SD(A) of chromosome number vs. length, (72 x 0.5 M or 12 x 3M) for total 36 M, using 
information only on number of shared segments (n s ) or using their numbers, position, and length n s , p s , and t s 

Analysis using only n s Analysis using n s , p s , and t s 

R 1/4 1/32 1/4 1/32 



R HS HC1R HS HC1R 

Number of chromosomes 







72 


12 


72 


12 


72 

E(A)/SD(A) 


12 


72 


12 


1/4 


UN 


0.82 


0.94 


7.65 


5.11 


0.97 


0.98 


7.69 


6.35 




HS 


0.00 


0.00 


6.84 


4.52 


0.00 


0.00 


6.97 


5.38 




GPO 


2.07 


2.53 


5.00 


2.77 


2.68 


2.66 


5.93 


4.34 


1/8 


C 


2.01 


1.42 


3.86 


3.20 


3.60 


2.60 


3.89 


3.28 




HUN 


2.73 


1.29 


3.06 


2.14 


3.27 


2.26 


3.22 


2.34 




G2PO 


3.57 


2.41 


2.45 


1.45 


3.65 


2.77 


2.95 


2.08 


1/32 


2C 


6.40 


3.17 


0.27 


0.28 


8.64 


5.86 


0.47 


0.41 




HC1R 


6.89 


3.61 


0.00 


0.00 


8.42 


5.75 


0.00 


0.00 




G4PO 


7.33 


4.17 


0.29 


0.32 


8.32 


5.85 


0.50 


0.45 


1/128 


3C 


11.04 


6.10 


1.71 


1.22 


14.36 


9.75 


2.01 


1.48 




H2C1R 


11.58 


6.47 


1.88 


1.36 


14.38 


9.77 


2.05 


1.50 




G6PO 


11.92 


6.98 


1.99 


1.54 


14.14 


9.94 


2.07 


1.60 



HS, half-sib; HC1R, half-cousin once removed; UN, uncle-nephew; GPO, grandparent-grandoffspring; C, cousin; HUN, half-uncle-nephew; G2PO, great-grandparent- 
grandoffspring; 2C, second cousin; G4PO, great-great-great-grandparent-grandoffspring; 3C, third cousin; H2C1R, half second cousin once removed; G6PO, great- 
great-great-great-great-grandparent-grandoffspring. 



Simulated data in supplementary files. 
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